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Pseudo-random graphs 


Michael Krivelevich* Benny Sudakov 1 


1 Introduction 

Random graphs have proven to be one of the most important and fruitful concepts in modern 
Combinatorics and Theoretical Computer Science. Besides being a fascinating study subject for 
their own sake, they serve as essential instruments in proving an enormous number of combinatorial 
statements, making their role quite hard to overestimate. Their tremendous success serves as a 
natural motivation for the following very general and deep informal questions: what are the essential 
properties of random graphs? How can one tell when a given graph behaves like a random graph? 
How to create deterministically graphs that look random-like? This leads us to a concept of pseudo¬ 
random graphs. 

Speaking very informally, a pseudo-random graph G = {V,E) is a graph that behaves like a 
truly random graph G{\V\,p) of the same edge density p = \E\ j . Although the last sentence 
gives some initial idea about this concept, it is not very informative, as first of all it does not say 
in which aspect the pseudo-random graph behavior is similar to that of the corresponding random 
graph, and secondly it does not supply any quantitative measure of this similarity. There are 
quite a few possible graph parameters that can potentially serve for comparing pseudo-random and 
random graphs (and in fact quite a few of them are equivalent in certain, very natural sense, as we 
will see later), but probably the most important characteristics of a truly random graph is its edge 
distribution. We can thus make a significant step forward and say that a pseudo-random graph 
is a graph with edge distribution resembling the one of a truly random graph with the same edge 
density. Still, the quantitative measure of this resemblance remains to be introduced. 

Although first examples and applications of pseudo-random graphs appeared very long time ago, 
it was Andrew Thomason who launched systematic research on this subject with his two papers 
m, iHni in the mid-eighties. Thomason introduced the notion of jumbled graphs, enabling to 
measure in quantitative terms the similarity between the edge distributions of pseudo-random and 
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truly random graphs. He also supplied several examples of pseudo-random graphs and discussed 
many of their properties. Thomason’s papers undoubtedly defined directions of future research for 
many years. 

Another cornerstone contribution belongs to Chung, Graham and Wilson |2bj who in 1989 
showed that many properties of different nature are in certain sense equivalent to the notion of 
pseudo-randomness, defined using the edge distribution. This fundamental result opened many 
new horizons by showing additional facets of pseudo-randomness. 

Last years brought many new and striking results on pseudo-randomness by various researchers. 
There are two clear trends in recent research on pseudo-random graphs. The first is to apply 
very diverse methods from different fields (algebraic, linear algebraic, combinatorial, probabilistic 
etc.) to construct and study pseudo-random graphs. The second and equally encouraging is to 
find applications, in many cases quite surprising, of pseudo-random graphs to problems in Graph 
Theory, Gomputer Science and other disciplines. This mutually enriching interplay has greatly 
contributed to significant progress in research on pseudo-randomness achieved lately. 

The aim of this survey is to provide a systematic treatment of the concept of pseudo-random 
graphs, probably the first since the two seminal contributions of Thomason m. isni. Research 
in pseudo-random graphs has developed tremendously since then, making it impossible to provide 
full coverage of this subject in a single paper. We are thus forced to omit quite a few directions, 
approaches, theorem proofs from our discussion. Nevertheless we will attempt to provide the reader 
with a rather detailed and illustrative account of the current state of research in pseudo-random 
graphs. 

Although, as we will discuss later, there are several possible formal approaches to pseudo¬ 
randomness, we will mostly emphasize the approach based on graph eigenvalues. We find this 
approach, combining linear algebraic and combinatorial tools in a very elegant way, probably the 
most appealing, convenient and yet quite powerful. 

This survey is structured as follows. In the next section we will discuss various formal defini¬ 
tions of the notion of pseudo-randomness, from the so called jumbled graphs of Thomason to the 
(n, d, A)-graphs defined by Alon, where pseudo-randomness is connected to the eigenvalue gap. We 
then describe several known constructions of pseudo-random graphs, serving both as illustrative 
examples for the notion of pseudo-randomness, and also as test cases for many of the theorems 
to be presented afterwards. The strength of every abstract concept is best tested by properties it 
enables to derive. Pseudo-random graphs are certainly not an exception here, so in Section 4 we 
discuss various properties of pseudo-random graphs. Section 5, the final section of the paper, is 
devoted to concluding remarks. 

2 Definitions of pseudo-random graphs 

Pseudo-random graphs are much more of a general concept describing some graph theoretic phe¬ 
nomenon than of a rigid well defined notion - the fact reflected already in the plural form of the title 
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of this section! Here we describe various formal approaches to the concept of pseudo-randomness. 
We start with stating known facts on the edge distribution of random graphs, that will serve later as 
a benchmark for all other definitions. Then we discuss the notion of jumbled graphs introduced by 
Thomason in the mid-eighties. Then we pass on to the discussion of graph properties, equivalent in 
a weak (qualitative) sense to the pseudo-random edge distribution, as revealed by Chung, Graham 
and Wilson in m- Our next item in this section is the definition of pseudo-randomness based on 
graph eigenvalues - the approach most frequently used in this survey. Finally, we discuss the related 
notion of strongly regular graphs, their eigenvalues and their relation to pseudo-randomness. 


2.1 Random graphs 

As we have already indicated in the Introduction, pseudo-random graphs are modeled after truly 
random graphs, and therefore mastering the edge distribution in random graphs can provide the 
most useful insight on what can be expected from pseudo-random graphs. The aim of this subsection 
is to state all necessary dehnitions and results on random graphs. We certainly do not intend to be 
comprehensive here, instead referring the reader to two monographs on random graphs m, m. 
devoted entirely to the subject and presenting a very detailed picture of the current research in this 
area. 

A random graph G{n,p) is a probability space of all labeled graphs on n vertices {!,... ,n}, 
where for each pair I < i < j < n, {i,j) is an edge of G{n,p) with probability p = p{n), indepen¬ 
dently of any other edges. Equivalently, the probability of a graph G = {V, E) with V = {1,... , n} 
in G{n,p) is Pr[G] = — p)( 2 ) "We will occasionally mention also the probabil¬ 

ity space Gn,d, this is the probability space of all d-regular graphs on n vertices endowed with 
the uniform measure, see the survey of Wormald IHSI for more background. We also say that a 
graph property A holds almost surely, or a.s. for brevity, in G{n,p) {Gn,d) if the probability that 
G{n,p) {Gn,d) has A tends to one as the number of vertices n tends to infinity. 

From our point of view the most important parameter of random graph G{n,p) is its edge 
distribution. This characteristics can be easily handled due to the fact that G{n,p) is a product 
probability space with independent appearances of different edges. Below we cite known results on 
the edge distribution in G{n,p). 


Theorem 2.1 Let p 

u vertices, then 


p(n) < 0.99. Then almost surely G G G{n,p) is such that ifU is any set of 


e{U) -p 


u 


= O log^/^(2n/ti)^ . 


Theorem 2.2 Let p = p{n) < 0.99. Then almost surely G G G{n,p) is such that if U,W are 
disjoint sets of vertices satisfying u = \U\ < w = \W\, then 

\e{U, W) — puw\ = O log^/^(2n/t(;)^ . 
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The proof of the above two statements is rather straightforward. Notice that both quantities 
e{U) and e{U, W) are binomially distributed random variables with parameters ( 2 ) and p, and uw 
and p, respectively. Applying standard Chernoff-type estimates on the tails of the binomial distri¬ 
bution (see, e.g., Appendix A of CHI) and then the union bound, one gets the desired inequalities. 

It is very instructive to notice that we get less and less control over the edge distribution as the 
set size becomes smaller. For example, in the probability space G{n, 1/2) every subset is expected 
to contain half of its potential edges. While this is what happens almost surely for large enough sets 
due to Theorem EH there will be almost surely sets of size about 2 log 2 n containing all possible 
edges (i.e. cliques), and there will be almost surely sets of about the same size, containing no edges 
at all (i.e. independent sets). 

For future comparison we formulate the above two theorems in the following unified form: 

Corollary 2.3 Let p = p{n) < 0.99. Then almost surely in G{n,p) for every two (not necessarily) 
disjoint subsets of vertices U,W C V of cardinalities \U\ = u, |1F| = w, the number e{U,W) of 
edges of G with one endpoint in U and the other one in W satisfies: 

\e{U, W) — puw\ = 0{^uwnp) . (1) 

(A notational agreement here and later in the paper: if an edge e belongs to the intersection t/nW, 
then e is counted twice in e([/, IT).) 

Similar bounds for edge distribution hold also in the space Gn^ of d-regular graphs, although 
they are significantly harder to derive there. 

Inequality o provides us with a quantitative benchmark, according to which we will later 
measure the uniformity of edge distribution in pseudo-random graphs on n vertices with edge 
density p = \E{G)\ /(”) 

It is interesting to draw comparisons between research in random graphs and in pseudo-random 
graphs. In general, many properties of random graphs are much easier to study than the corre¬ 
sponding properties of pseudo-random graphs, mainly due to the fact that along with the almost 
uniform edge distribution described in Corollary 12.31 random graphs possess as well many other 
nice features, first and foremost of them being that they are in fact very simply defined product 
probability spaces. Certain graph properties can be easily shown to hold almost surely in G{n,p) 
while they are not necessarily valid in pseudo-random graphs of the same edge density. We will 
see quite a few such examples in the next section. A general line of research appears to be not 
to use pseudo-random methods to get new results for random graphs, but rather to try to adapt 
techniques developed for random graphs to the case of pseudo-random graphs, or alternatively to 
develop original techniques and methods. 

2.2 Thomason’s jumbled graphs 

In two fundamental papers m, iHoi published in 1987 Andrew Thomason introduced the first 
formal quantitative definition of pseudo-random graphs. It appears quite safe to attribute the 
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launch of the systematic study of pseudo-randomness to Thomason’s papers. 

Thomason used the term ’’jumbled” graphs in his papers. A graph G = (V, E) is said to be 
(p, a)-jumbled if p, a are real numbers satisfying 0<p<l<a!if every subset of vertices U C V 
satishes: 


e{U) -p 



< a\U\ . 


( 2 ) 


The parameter p can be thought of as the density of G, while a controls the deviation from the 
ideal distribution. According to Thomason, the word ’’jumbled” is intended to convey the fact that 
the edges are evenly spread throughout the graph. 

The motivation for the above definition can be clearly traced to the attempt to compare the edge 
distribution in a graph G to that of a truly random graph G{n,p). Applying it indeed to G{n,p) 
and recalling m we conclude that the random graph G{n,p) is almost surely 0(y^np)-jumbled. 

Thomason’s definition has several trivial yet very nice features. Observe for example that if G 
is (p, a)-jumbled then the complement G is (1 — p, aj-jumbled. Also, the definition is hereditary - 
if G is (p, a)-jumbled, then so is every induced subgraph H of G. 

Note that being (p, 0(np))-jumbled for a graph G on n vertices and (J^'jp edges does not say 
too much about the edge distribution of G as the number of edges in linear sized sets can deviate 
by a percentage from their expected value. However as we shall see very soon if G is known to 
be (p, o(np))-jumbled, quite a lot can be said about its properties. Of course, the smaller is the 
value of a, the more uniform or jumbled is the edge distribution of G. A natural question is then 
how small can be the parameter a = a(n,p) for a graph G = (V,E) on |H| = n vertices with edge 
density p = \E\ j ( 2 ) ? Erdos and Spencer proved in inn] that a satishes a = Q{^/n) for a constant 
p; their method can be extended to show a = Q{y/np) for all values of p = p(n). We thus may 
think about (p, 0(y^rip))-jumbled graphs on n vertices as in a sense best possible pseudo-random 
graphs. 

Although the fact that G is (p, a)-jumbled carries in it a lot of diverse information on the 
graph, it says almost nothing (directly at least) about small subgraphs, i.e. those spanned by 
subsets U of size \U\ = o{a/p). Therefore in principle a (p, a)-jumbled graph can have subsets of 
size \U\ = 0{alp) spanning by a constant factor less or more edges then predicted by the uniform 
distribution. In many cases however quite a meaningful local information (such as the presence of 
subgraphs of hxed size) can still be salvaged from global considerations as we will see later. 

Condition 0 has obviously a global nature as it applies to all subsets of G, and there are 
exponentially many of them. Therefore the following result of Thomason, providing a sufficient 
condition for pseudo-randomness based on degrees and co-degrees only, carries a certain element of 
surprise in it. 


Theorem 2.4 UEI Let G be a graph on n vertices with minimum degree np. If no pair of vertices 
of G has more than np^ + I common neighbors, then G is (p, ^{p + l)n)-jumbled. 

The above theorem shows how the pseudo-randomness condition of 0 can be ensured/checked by 
testing only a polynomial number of easily accessible conditions. It is very useful for showing that 
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specific constructions are jumbled. Also, it can find algorithmic applications, for example, a very 
similar approach has been used by Alon, Duke, Lefmann, Rodl and Yuster in their Algorithmic 
Regularity Lemma [HI. 

As observed by Thomason, the minimum degree condition of Theorem 12.41 can be dropped if 
we require that every pair of vertices has (1 + o(l))np^ common neighbors. One cannot however 
weaken the conditions of the theorem so as to only require that every edge is in at most np^ + I 
triangles. 

Another sufficient condition for pseudo-randomness, this time of global nature, has also been 
provided in m, iHni: 

Theorem 2.5 UB Let G be a graph of order n, let rjn be an integer between 2 and n — 2, and let 
u > 1 be a real number. Suppose that each induced subgraph H of order gn satisfies \e{H)—p(J'^) \ < 
gna. Then G is {p, 7y/najgl{l — g))-jumbled. Moreover G contains a subset U C V{G) of size 
\U\ > (l — ^ such that the induced subgraph G[U] is {p,uja)-jumbled. 

Thomason also describes in m. Eoi several properties of jumbled graphs. We will not discuss 
these results in details here as we will mostly adopt a different approach to pseudo-randomness. 
Occasionally however we will compare some of later results to those obtained by Thomason. 


2.3 Equivalent definitions of weak pseudo-randomness 


Let us go back to the jumbledness condition m of Thomason. As we have already noted it becomes 
non-trivial only when the error term in m is o{n^p). Thus the latter condition can be considered 
as the weakest possible condition for pseudo-randomness. 

Guided by the above observation we now define the notion of weak pseudo-randomness as 
follows. Let {Gn) be a sequence of graphs, where Gn has n vertices. Let also p = p{n) is a 
parameter (p(n) is a typical density of graphs in the sequence). We say that the sequence {Gn) is 
weakly pseudo-random if the following condition holds: 


For all subsets U C V{Gn), 


e{U)-p 



o{n^p) . 


( 3 ) 


For notational convenience we will frequently write G = tacitly assuming that (G) is in fact a 
sequence of graphs. 

Notice that the error term in the above condition of weak pseudo-randomness does not depend 
on the size of the subset U. Therefore it applies essentially only to subsets U of linear size, ignoring 
subsets U of size o(n). Hence m is potentially much weaker than Thomason’s jumbledness condition 

©• 

Corollary 12.31 supplies us with the first example of weakly pseudo-random graphs - a random 
graph G{n,p) is weakly pseudo-random as long as p{n) satisfies np —> oo. We can thus say that if 
a graph G on n vertices is weakly pseudo-random for a parameter p, then the edge distribution of 
G is close to that of G{n,p). 
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In the previous subsection we have already seen examples of conditions implying pseudo¬ 
randomness. In general one can expect that conditions of various kinds that hold almost surely in 
G{n,p) may imply or be equivalent to weak pseudo-randomness of graphs with edge density p. 

Let us first consider the case of the constant edge density p. This case has been treated exten¬ 
sively in the celebrated paper of Chung, Graham and Wilson from 1989 m, where they formulated 
several equivalent conditions for weak pseudo-randomness. In order to state their important result 
we need to introduce some notation. 

Let G = {V,E) be a graph on n vertices. For a graph L we denote by Nq{L) the number of 
labeled induced copies of L in G, and by Ng{L) the number of labeled not necessarily induced 
copies of L in G. For a pair of vertices x,y € I^(G), we set s{x,y) to be the number of vertices of 
G joined to x and y the same way: either to both or to none. Also, codeg{x, y) is the number of 
common neighbors of x and y in G. Finally, we order the eigenvalues Aj of the adjacency matrix 
A{G) so that I All > IA 2 I > ... > |A„|. 

Theorem 2.6 Iil 6 ) / Let p G (0,1) be fixed. For any graph sequence {Gn) the following properties 
are equivalent: 

Pi{l)- For a fixed I > 4 for all graphs L on I vertices, 

Nq{L) = {1 + o{l))n’‘p'^^^^\l — . 

P 2 {t)' Let Gt denote the cycle of length t. Let t > A be even, 

2 

e(Gn) = ^ + o(n^) and NciGt) < {npf + o{n^) . 

P 3 : e{Gn) > 'Af + o{n^) and Xi = {I + o{l))np, A 2 = o(n) . 

P 4 : For each subset Lf C V{G), e{U) = -|-o(n^) . 

P 5 : For each subset U C V{G) with \U\ = [^J, we have e{U) = (| + o(l)) ■ 

PG Y.x,y^v y) - (p‘^ + (1 - • 

Pp- I 2 x,yGV\codeg{x,y) -p'^nl = o{n^) . 

Note that condition P 4 of this remarkable theorem is in fact identical to our condition Q of 
weak pseudo-randomness. Thus according to the theorem all conditions P 1 -P 3 , P 5 — Pj are in fact 
equivalent to weak pseudo-randomness! 

As noted by Chung et al. probably the most surprising fact (although possibly less surprising 

for the reader in view of Theorem l2.4l) is that apparently the weak condition ^ 2 ( 4 ) is strong enough 

to imply weak pseudo-randomness. 

It is quite easy to add another condition to the equivalence list of the above theorem: for all 
U,W CV, e{U,W) = p\U\\W\ + o{n^). 
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A condition of a very different type, related to the celebrated Szemeredi Regularity Lemma 
has been added to the above list by Simonovits and Sos in m They showed that if a graph G 
possesses a Szemeredi partition in which almost all pairs have density p, then G is weakly pseudo¬ 
random, and conversely if G is weakly pseudo-random then in every Szemeredi partition all pairs are 
regular with density p. An extensive background on the Szemeredi Regularity Lemma, containing 
in particular the definitions of the above used notions, can be found in a survey paper of Komlos 
and Simonovits m- 

The reader may have gotten the feeling that basically every property of random graphs G{n,p) 
ensures weak pseudo-randomness. This feeling is quite misleading, and one should be careful while 
formulating properties equivalent to pseudo-randomness. Here is an example provided by Chung 
et al. Let G be a graph with vertex set 4n} defined as follows: the subgraph of G spanned 

by the first 2n vertices is a complete bipartite graph Kn^n, the subgraph spanned by the last 2 n 
vertices is the complement of Kn^n, and for every pair (i, j), 1 < i < 2re, 2n + 1 < j < 4n, the edge 
(i,j) is present in G independently with probability 0.5. Then G is almost surely a graph on 4n 
vertices with edge density 0.5. One can verify that G has properties ^ 1 ( 3 ) and T 2 ( 2 t -|- 1) for every 
t > 1, but is obviously very far from being pseudo-random (contains a clique and an independent 
set of one quarter of its size). Hence Ti(3) and P2{2,t + 1) are not pseudo-random properties. This 
example shows also the real difference between even and odd cycles in this context - recall that 
Property P2{‘2t) does imply pseudo-randomness. 

A possible explanation to the above described somewhat disturbing phenomenon has been 
suggested by Simonovits and Sos in m They noticed that the above discussed properties are 
not hereditary in the sense that the fact that the whole graph G possesses one of these properties 
does not imply that large induced subgraphs of G also have it. A property is called hereditary in 
this context if it is assumed to hold for all sufficiently large subgraphs F of our graph G with the 
same error term as for G. Simonovits and Sos proved that adding this hereditary condition gives 
significant extra strength to many properties making them pseudo-random. 

Theorem 2.7 Let L be a fixed graph on I vertices, and let p G (0,1) he fixed. Let (Gn) be a 
sequence of graphs. If for every induced subgraph H P G on h vertices, 

Nh{L) = -|- o(n^), 

then {Gn) is weakly pseudo-random, i.e. property P 4 holds. 

Two main distinctive features of the last result compared to Theorem 12 .bl are: (a) Ti(3) assumed 
hereditarily implies pseudo-randomness; and (b) requiring the right number of copies of a single 
graph L on I vertices is enough, compared to Condition Pi{l) required to hold for all graphs on I 
vertices simultaneously. 

Let us switch now to the case of vanishing edge density p{n) = o(l). This case has been treated 
in two very recent papers of Chung and Graham HH] and of Kohayakawa, Rodl and Sissokho isni. 
Here the picture becomes significantly more complicated compared to the dense case. In particular. 


there exist graphs with very balanced edge distribution not containing a single copy of some fixed 
subgraphs (see the Erdos-Renyi graph and the Alon graph in the next section (Examples 6 , 9, 
resp.)). 

In an attempt to find properties equivalent to weak pseudo-randomness in the sparse case, 
Chung and Graham define the following properties in m ■■ 

CIRCUIT(t): The number of closed walks 'Wo,wi,... ,wt = wq of length t in G is (1 -|- o(l))(np)*; 
CYCLE(t): The number of labeled f-cycles in G is (1 -|- o(l))(np)*; 

EIG: The eigenvalues Aj, |Ai| > IA 2 I > ... |An|, of the adjacency matrix of G satisfy: 

Ai = (l-ho(l))np, 

I Ail = o{np),i>l. 


DISC: For all X,Y C R(G), 

\e{X,Y)-p\X\\Y\\=o{pn^) . 

(DISC here is in fact DICS(l) in [^1. 

Theorem 2.8 \‘2,^ Let (G = Gn '■ n —> 00) he a sequence of graphs with e(G„) = (1 -|- o(l))p( 2 ). 
Then the following implications hold for all t > 1: 

CIRCUIT{2t) => EIG => DISG . 

Proof. To prove the first implication, let A be the adjacency matrix of G, and consider the 
trace Tr{A^^). The (i,i)-entry of is equal to the number of closed walks of length 2t starting 
and ending at i, and hence Tr{A^^) = (1 -|- o(l))(np)^^ On the other hand, since A is symmetric it 
is similar to the diagonal matrix D = diag{Xi, A 2 , • • •, A„), and therefore Tr{A^^) = 
obtain: 

n 

^Af = (1 + o(l))(np)2* . 

i=l 

Since the first eigenvalue of G is easily shown to be as large as its average degree, it follows that 
Ai > 2|£'(G)|/|R(G)| = (1 -I- o{l))np. Combining these two facts we derive that Ai = (1 -|- o(l))np 
and |Ai| = o{np) as required. 

The second implication will be proven in the next subsection. □ 

Both reverse implications are false in general. To see why DISG 7 ^ EIG take a graph Go on 
n — 1 vertices with all degrees equal to (1 -|- o(l))n°'^ and having property DISG (see next section 
for examples of such graphs). Now add to Gq a vertex v* and connect it to any set of size n®'® in 
Go, let G be the obtained graph. Since G is obtained from Gq by adding o(|£'(Go|) edges, G still 
satisfies DISG. On the other hand, G contains a star S of size with a center at v*, and hence 
Ai(G) > Ai(S') = — 1 3> |£'(G)|/n (see, e.g. Chapter 11 of [OH for the relevant proofs). This 

solves an open question from |25] . 
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The Erdos-Renyi graph from the next section is easily seen to satisfy EIG, but fails to satisfy 
CIRCUIT{4:). Chung and Graham provide an alternative example in (Example 1). 

The above discussion indicates that one probably needs to impose some additional condition on 
the graph G to glue all these pieces together and to make the above stated properties equivalent. 
One such condition has been suggested by Chung and Graham who defined; 

U(t): Eor some absolute constant c, all degrees in G satisfy: d{v) < cnp, and for every pair of 

vertices x,y G G the number et-i{x,y) of walks of length t — 1 from x to y satisfies: et-i{x,y) < 
/—2 

cn p . 

Notice that U{t) can only hold for p > where d depends on c. Also, every dense 

graph {p = 0(1)) satisfies U{t). 

As it turns out adding property U(t) makes all the above defined properties equivalent and thus 
equivalent to the notion of weak pseudo-randomness (that can be identified with property DISG): 

Theorem 2.9 Suppose for some constant c > 0, p{n) > where t > 2. For any 

family of graphs Gn, \E{Gn)\ = (1 -|- o(l))p( 2 ), satisfying U{t), the following properties are all 
equivalent: GIRCUIT{2t),GYGLE{2t), EIG and DISG. 

Theorem 12.91 can be viewed as a sparse analog of Theorem 12.hi as it also provides a list of 
conditions equivalent to weak pseudo-randomness. 

Eurther properties implying or equivalent to pseudo-randomness, including local statistics con¬ 
ditions, are given in m- 

2.4 Eigenvalues and pseudo-random graphs 

In this subsection we describe an approach to pseudo-randomness based on graph eigenvalues - 
the approach most frequently used in this survey. Although the eigenvalue-based condition is not 
as general as the jumbledness condition of Thomason or some other properties described in the 
previous subsection, its power and convenience are so appealing that they certainly constitute 
a good enough reason to prefer this approach. Below we first provide a necessary background 
on graph spectra and then derive quantitative estimates connecting the eigenvalue gap and edge 
distribution. 

Recall that the adjacency matrix of a graph G = {V, E) with vertex set E = {1,... , n} is an n-by- 
n matrix whose entry Uij is 1 if (f, j) G E{G), and is 0 otherwise. Thus A is a 0,1 symmetric matrix 
with zeroes along the main diagonal, and we can apply the standard machinery of eigenvalues and 
eigenvectors of real symmetric matrices. It follows that all eigenvalues of A (usually also called the 
eigenvalues of the graph G itself) are real, and we denote them by Ai > A 2 > ... > Xn- Also, there 
is an orthonormal basis B = {xi, ... ,Xn} of the euclidean space R^ composed of eigenvectors of 
A: Axi = AjXj, x\xi = I,f = l,...,n. The matrix A can be decomposed then as: A = 

- the so called spectral decomposition of A. (Notice that the product xx*, x G R", is an n-by-n 
matrix of rank 1; if x,y,z G R” then y*(xx*)z = (?/*x)(x*z)). Every vector y G R^ can be easily 
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represented in basis B: y = Therefore, for y,z G R^, y^z = 

\\y\\^ = y'y = n=M^^)"■ 

All the above applies in fact to all real symmetric matrices. Since the adjacency matrix A of 
a graph G is a matrix with non-negative entries, one can derive some important extra features of 
A, most notably the Perron-Frobenius Theorem, that reads in the graph context as follows; if G 
is connected then the multiplicity of Ai is one, all coordinates of the first eigenvector xi can be 
assumed to be strictly positive, and |Aj| < Ai for all i > 2. Thus, graph spectrum lies entirely in 
the interval [—Ai,Ai]. 

For the most important special case of regular graphs Perron-Frobenius implies the following 
corollary: 

Proposition 2.10 Let G be a d-regular graph on n vertices. Let Xi > X 2 >■■■> Xn be the 
eigenvalues of G. Then Xi = d and —d < Xi < d for all 1 < i < n. Moreover, if G is connected 
then the first eigenvector xi is proportional to the all one vector (1,..., 1 )* G and Xi < d for 
all i >2. 


To derive the above claim from the Perron-Frobenius Theorem observe that e = (1,... , 1) is 
immediately seen to be an eigenvector of A(G) corresponding to the eigenvalue d: Ae = de. The 
positivity of the coordinates of e implies then that e is not orthogonal to the first eigenvector, and 
hence is in fact proportional to xi of A{G). Proposition 12.101 can be also proved directly without 
relying on the Perron-Frobenius Theorem. 

We remark that Xn = —d is possible, in fact it holds if and only if the graph G is bipartite. 

All this background information, presented above in a somewhat condensed form, can be found 
in many textbooks in Linear Algebra. Readers more inclined to consult combinatorial books can 
find it for example in a recent monograph of Godsil and Royle on Algebraic Graph Theory m- 

We now prove a well known theorem (see its variant, e.g., in Ghapter 9, m) bridging between 
graph spectra and edge distribution. 


Theorem 2.11 Let G be a d-regular graph on n vertices. Let d = Xi > X 2 > ■.. Xn be the 
eigenvalues of G. Denote 

X — lTiaX2<i<n\Xi\ . 

Then for every two subsets Lf,W C P, 


e{U, W) 


d\U\\W\ 


n 


<xJ\u\\w\{i 


n J \ n J 


( 4 ) 


Proof. Let B = {xi,... ,Xn} be an orthonormal basis of R^ composed from eigenvectors of A: 
Axi = XiXi, 1 < i < n. We represent A = XiXixj. Denote 


Ai = Aixix^ , 

n 

T = ^ XiXix\ , 

i=2 
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then A = Ai -h £. 

Let u = \U\, w = \ W\ be the cardinalities of U,W, respectively. We denote the characteristic 
vector of U by xu £ be. Xui'i) = 1 if i £ C/, and xu{i) = 0 otherwise. Similarly, let xw £ b?” 
be the characteristic vector of W. We represent xUj Xw according to B: 


XU 

Xw 


n n 



— XljXi-, 

E“? ^ llxr/P 

i=l 


i=l 

TL 

'^PiXi, 

i=l 

Pi = XwXi, 

E = llxivll' 

i=l 


It follows easily from the definitions of A, xu and Xw that the product Xu^Xw counts exactly the 
number of edges of G with one endpoint in U and the other one in W, i.e. 


e{U,W) — Xu^Xw — Xu^iXw + Xu^Xw ■ 


Now we estimate the last two summands separately, the first of them will be the main term for 
e{U,W), the second one will be the error term. Substituting the expressions for xu-, Xw and 
recalling the orthonormality of i?, we get: 


XiJ^iXW 



(Aixixi) 






n n 

EE aiXiPj{xlxi){x\xj) = ai(5i\i . ( 5 ) 

i=l j=l 


Similarly, 

( n I ^ \/^ \ ^ 

^ OiXi I ^ >^jXjX^j I ^ (5kXk I = ^ aifSiXi . (6) 

i=l ) \j=2 j \fc=l / i=2 

Recall now that G is d-regular. Then according to Proposition 12.101 Ai = d and xi = 
-^(1,..., 1)*. We thus get: cci = XxjXi = uj^/n and f5i = Xvu^i = wj^fn. Hence it follows 
from © that Xu-^iXw = duw/n. 

Now we estimate the absolute value of the error term Xu^Xw- Recalling 0, the definition of 
A and the obtained values of ai, Pi, we derive, applying Cauchy-Schwartz: 


\Xu^Xw\ 


n 


n 


n n 


^ ^ OiiPiXi I E 

i=2 


X\'^aiPi\ < A, 

i=2 \ 




A\/(||xt/P -a?)(llxwP -/3f) 




The theorem follows. 


□ 


The above proof can be extended to the irregular (general) case. Since the obtained quantitative 
bounds on edge distribution turn out to be somewhat cumbersome, we will just indicate how they 
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can be obtained. Let G = {V, E) be a graph on n vertices with average degree d. Assume that the 
eigenvalues of G satisfy X < d, with A as defined in the theorem. Denote 


K= '^{d{v)-df . 

v£V 

The parameter AT is a measure of irregularity of G. Clearly AT = 0 if and only if G is d-regular. 
Let e = -^(1,..., l)h We represent e in the basis B = {xi,..., Xn} of the eigenvectors of A{G): 


n n 

e = '^liXi, -fi = e^Xi, ^7f = ||ef = l. 
i=l i=l 


Denote z = -^{d{vi) — d ,... , d{vn) — dY, then \\z\Y = K/n. Notice that Ae = ■^{d{vi ),..., d{vn)Y 
= de + z, and therefore z = Ae — de = ~ d)xi. This implies: 

n n 

i=l i=2 

n 

> {d-xYY.^j- 

i=2 

Hence ^”^2 7? < n{d-xy^ • follows that 7i = 1 - EL 2 > 1 “ n{d-xy^ ^nd 


71 > 7 ? > 1 


K 

n{d — \Y 


Now we estimate the distance between the vectors e and xi and show that they are close given that 
the parameter AT is small. 


||e — xill^ = (e — xi)*(e — xi) = e*e + x\xi — 2 e*xi = 1 + 1 — 271 = 2 — 271 
2K 

~ n{d — Xy 

We now return to expressions © and m from the proof of Theorem 12.111 In order to estimate 
the main term we bound the coefficients ai, /3i and Ai as follows: 


u 


ai = Xuxi = Xf/e + Xuixi - e) = ^ + Xui^i - e) , 

\/n 


and therefore 


ai 


= \Xv{xi - e)| < WxuW ■ Iki - e|| < 


2Ku 


d-X 


In a similar way one gets: 


n 


< 


2 X 11 } 


d-X 


(7) 

( 8 ) 
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Finally, to estimate from above the absolute value of the difference between Ai and d we argue as 
follows: 

— = Ikf = - df > 7i(Ai -d)^ 

2=1 


and therefore 


,, ,, \ [K nid-Xf Ik 

\Xi-d\<-\ -< ^ ’ -I 


(9) 


7 i V n n{d — Xy — K \ n 
Summarizing, we see from 0 , 0 and 0 that the main term in the product XijAiXw is equal to 
just as in the regular case, and the error term is governed by the parameter K. 

In order to estimate the error term Xu^Xw we use 0 to get: 


\xv£xw\ = 

n 

^ ^ Of-iPiXi 

< X 

n 


n n 

E7E7 


i=2 

n 

n 

i=2 

\ 

i=2 1=2 


< A, 


\ =^\\xu\\\\xw\\=X./^. 

\ i=l i=l 


□ 


Applying the above developed techniques we can prove now the second implication of Theorem 
EHl Let us prove first that EIG implies K = o{nd‘^), where d = (1 + o{l))np is as before the 
average degree of G. Indeed, for every vector v G -R” we have ||Au|| < Ai||u||, and therefore 

Xfn = Xfe^e > (Ae)*(Ae) = ^ d^(v) . 

vev 

Hence from EIG we get: Ylvev d?{v) < (1 + o{l))nd'^. As Ylv di^) = nd, it follows that: 

K = ^(d(u) — d)^ = ^ d^{v) — 2d ^ d{v) + nd^ = (1 + o(l))nd^ — 2nd^ + nd^ = o{ndF ), 

V^V V^V V^V 

as promised. Substituting this into estimates 0, ®, ® and using A = o(d) of EIG we get: 

u 


ot\ — 


/n 

w 


+ o{-s/u ), 


/?! = ^ + o{y/w) , 

vn 


and therefore 


Ai — (1 + o(l))d, 


f A duw , , . 

XuAiXw = -h o(dn) 

n 


Also, according to EIG, A = o(d), which implies: 

Xv^Xw = o{dy/ww) = o{dn ), 


and the claim follows. 


□ 


14 












Theorem l2.11l is a truly remarkable result. Not only it connects between two seemingly unrelated 
graph characteristics - edge distribution and spectrum, it also provides a very good quantitative 
handle for the uniformity of edge distribution, based on easily computable, both theoretically and 
practically, graph parameters ~ graph eigenvalues. According to the bound a polynomial 
number of parameters can control quite well the number of edges in exponentially many subsets of 
vertices. 

The parameter A in the formulation of Theorem 12 .Ill is usually called the second eigenvalue of 
the d-regular graph G (the first and the trivial one being Ai = d). There is certain inaccuracy 
though in this term, as in fact A = max{A 2 , —An}. Later we will call, following Alon, a d-regular 
graph G on n vertices in which all eigenvalues, but the first one, are at most A in their absolute 
values, an {n,d, X)-graph. 

Comparing @ with the definition of jumbled graphs by Thomason we see that an (n, d, A)- 
graph G is (d/n, A)-jumbled. Hence the parameter A (or in other words, the so called spectral gap 
- the difference between d and A) is responsible for pseudo-random properties of such a graph. 
The smaller the value of A compared to d, the more close is the edge distribution of G to the ideal 
uniform distribution. A natural question is then: how small can be A? It is easy to see that as long 
as d < (1 — e)n, A = H(\/d). Indeed, the trace of satisfies: 

n 

nd = 2\E{G)\ = Tr{A^) = ^ A^ < d^ + (n - IjAa < (1 - e)nd +{n- 1)\^ , 

i=l 

and A = H(-v/d) as claimed. More accurate bounds are known for smaller values of d (see, e.g. m)- 
Based on these estimates we can say that an (n, d, A)-graph G, for which A = &{Vd), is a very 
good pseudo-random graph. We will see several examples of such graphs in the next section. 

2.5 Strongly regular graphs 

A strongly regular graph srg{n, d, tj, p) is a d-regular graph on n vertices in which every pair of 
adjacent vertices has exactly rj common neighbors and every pair of non-adjacent vertices has 
exactly /r common neighbors. (We changed the very standard notation in the above definition so 
as to avoid interference with other notational conventions throughout this paper and to make it 
more coherent, usually the parameters are denoted (u, fc. A,/r)). Two simple examples of strongly 
regular graph are the pentagon G 5 that has parameters (5,2,0,1), and the Petersen graph whose 
parameters are (10,3, 0,1). Strongly regular graphs were introduced by Bose in 1963 [21] who also 
pointed out their tight connections with finite geometries. As follows from the definition, strongly 
regular graphs are highly regular structures, and one can safely predict that algebraic methods 
are extremely useful in their study. We do not intend to provide any systematic coverage of this 
fascinating concept here, addressing the reader to the vast literature on the subject instead (see, 
s-g-j |^4j i. Our aim here is to calculate the eigenvalues of strongly regular graphs and then to 
connect them with pseudo-randomness, relying on results from the previous subsection. 
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Proposition 2.12 Let G be a eonnected strongly regular graph with parameters {n,d,r], p,). Then 
the eigenvalues of G are: Xi = d with multiplicity si = 1, 


^2 = ^ (r? - /X + ^/{ri - + 4{d - 

and 

h = ^{y- L- Vih- + p)^ , 

with multiplicities 

if {n - l){p - p) - 2d 

S 2 = — 1 H- , ^ 

2 - ??)2 + 4(d - ^) 

and 

s,= '^(n-l- in-l)ip-v)-2 d \ 

2 y - ??)2 + 4(d -//) y ’ 

respectively. 

Proof. Let A be the adjacency matrix of A. By the definition of A and the fact that A is 
symmetric with zeroes on the main diagonal, the (i,j)-entry of the square counts the number 
of common neighbors of Vi and Vj in G if i ^ j, and is equal to the degree d{vi) in case i = j. The 
statement that G is srg{n, d, p, p) is equivalent then to; 

AJ = dJ, A = {d — p)I + pJ + {p — p)A , (10) 

where J is the n-by-n all-one matrix and I is the n-by-n identity matrix. 

Since G is d-regular and connected, we obtain from the Perron-Frobenius Theorem that \i = d 
is an eigenvalue of G with multiplicity 1 and with e = (1,..., 1)* as the corresponding eigenvector. 
Let A 7 ^ d be another eigenvalue of G, and let x G be a corresponding eigenvector. Then x is 
orthogonal to e, and therefore Jx = 0. Applying both sides of the second identity in GDI) to x we 
get the equation: = (d — p)x + {p — p)Xx, which results in the following quadratic equation for 

A: 

A^ + (/i — p)X {p — d) = 0 . 

This equation has two solutions A 2 and A 3 as defined in the proposition formulation. If we denote 
by S 2 and S 3 the respective multiplicities of A 2 and A 3 as eigenvalues of A, we get: 

1 + S 2 + S 3 = n, Tr(A) = d + S 2 A 2 + S 3 A 3 = 0 . 

Solving the above system of linear equations for S 2 and S 3 we obtain the assertion of the proposition. 

□ 

Using the bound @ we can derive from the above proposition that if the parameters of a 
strongly regular graph G satisfy p ^ p then G has a large eigenvalue gap and is therefore a good 
pseudo-random graph. We will exhibit several examples of such graphs in the next section. 
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3 Examples 


Here we present some examples of pseudo-random graphs. Many of them are well known and 
already appeared, e.g., in m and m, but there also some which have been discovered only 
recently. Since in the rest of the paper we will mostly discuss properties of (n, d, A)-graphs, in our 
examples we emphasize the spectral properties of the constructed graphs. We will also use most of 
these constructions later to illustrate particular points and to test the strength of the theorems. 
Random graphs 

1. Let G = G{n,p) be a random graph with edge probability p. If p satisfies pn/logn oo 
and (1 — p)nlogn ^ oo, then almost surely all the degrees of G are equal to (1 -|- o{l))np. 
Moreover it was proved by Fiiredi and Komlos M that the largest eigenvalue of G is a.s. 
(l-|-o(l))np and that A(G) < (2 -|- o(l))y^p(l — p)n. They stated this result only for constant 
p but their proof shows that A(G') < 0{,ynp) also when p > polylogn/n. 

2. For a positive integer-valued function d = d{n) we define the model Gn^ of random regular 
graphs consisting of all regular graphs on n vertices of degree d with the uniform probability 
distribution. This definition of a random regular graph is conceptually simple, but it is not 
easy to use. Fortunately, for small d there is an efficient way to generate Gn,d which is useful 
for theoretical studies. This is the so called configuration model. For more details about this 
model, and random regular graphs in general we refer the interested reader to two excellent 
monographs mi and m, or to a survey |81-{j . As it turns out, sparse random regular graphs 
have quite different properties from those of the binomial random graph G{n,p),p = d/n. For 
example, they are almost surely connected. The spectrum of G^^d for a fixed d was studied in 
m by Friedman, Kahn and Szemeredi. Friedman m proved that for constant d the second 
largest eigenvalue of a random d-regular graph is A = (1 -|- o{l)) 2 ^/d — 1. The approach of 
Kahn and Szemeredi gives only 0{Vd) bound on A but continues to work also when d is 
small power of n. The case d !§> was recently studied by Krivelevich, Sudakov, Vu and 
Wormald m- They proved that in this case for any two vertices u,v € Gn,d almost surely 

\codeg{u,v) — d‘^/n\ < GS/r? -\r ^d'JXognj\fn^ 

where G is some constant and codeg{u, v) is the number of common neighbors of u, v. More¬ 
over if d > re/log n, then C can be defined to be zero. Using this it is easy to show that for 
d S> re^/^, the second largest eigenvalue of a random d-regular graph is o{d). The true bound 
for the second largest eigenvalue of Gn,d should be probably (1 -|- o(l))2 \/d — 1 for all values 
of d, but we are still far from proving it. 

Strongly regular graphs 

3. Let g = be a prime power which is congruent to 1 modulo 4 so that —1 is a square in 
the finite field GF{q). Let Pq be the graph whose vertices are all elements of GF[q) and two 
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vertices are adjacent if and only if their difference is a quadratic residue in GF{q). This graph 
is usually called the Paley graph. It is easy to see that Pq is {q — l)/2-regular. In addition 
one can easily compute the number of common neighbors of two vertices in Pq. Let y be the 
quadratic residue charaeter on GF{q), i.e., x(0) = 0, xi^) = 1 if x 7 ^ 0 and is a square in 
GF{q) and x(x) = —1 otherwise. By definition, x(x) = 0 and the number of common 
neighbors of two vertices a and b equals 



1 + x(q 
2 



1 + x(fe 



q-2 xia-b) 


+ x)x(b - x). 


x^a,b 


Using that for x ^ b, x{b — x) 


x[(b — x) the last term can be rewritten as 


X(« - x)x[{b -x) ^)=Y. = -!• 

x^a^b x^a^b x^a,b x^0,l 


Thus the number of common neighbors of a and 6 is {q — 3)/4 — x(o ~ b)/2. This equals 
(g — 5)/4 if a and b are adjacent and {q — l)/4 otherwise. This implies that the Paley graph 
is a strongly regular graph with parameters [q, {q — l)/2, [q — 5)/4, {q — l)/4) and therefore 
its second largest eigenvalue equals {^/q + l)/ 2 . 

4. For any odd integer k let Hk denote the graph whose Uk = 2^~^ — 1 vertices are all binary 
vectors of length k with an odd number of ones except the all one vector, in which two 
distinct vertices are adjacent iff the inner product of the corresponding vectors is 1 modulo 2 . 
Using elementary linear algebra it is easy to check that this graph is ( 2 ^“^ — 2 )-regular. Also 
every two nonadjacent vertices vertices in it have 2 ^~^ — 1 common neighbors and every two 
adjacent vertices vertices have 2^~^ — 3 common neighbors. Thus is a strongly regular 
graph with parameters ( 2 ^“^ — 1 , 2 *^“^ — 2 , 2 ^“^ — 3,2*^“^ — l) and with the second largest 

k—3 

eigenvalue X{Flk) = 1 + 2 ^~. 


5. Let g be a prime power an let V{G) be the elements of the two dimensional vector space over 
GF{q), so G has q'^ vertices. Partition the q + 1 lines through the origin of the space into 
two sets P and N, where |P| = k. Two vertices x and y of the graph G are adjacent if x — y 
is parallel to a line in P. This example is due to Delsarte and Goethals and to Turyn (see 
m- It is easy to check that G is strongly regular with parameters {^k{q — 1), {k — l){k — 
2) + q — 2, k{k — 1)). Therefore its eigenvalues, besides the trivial one are —k and q — k. Thus 
if k is sufficiently large we obtain that G \s d = k{q — l)-regular graph whose second largest 
eigenvalue is much smaller than d. 

Graphs arising from finite geometries 


6 . For any integer t >2 and for any power g = of prime p let PG{q,t) denote the projective 
geometry of dimension t over the finite field GF{q). The interesting case for our purposes here 
is that of large q and fixed t. The vertices of PG{q, t) correspond to the equivalence classes of 
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the set of all non-zero vectors x = {xq, ..., Xt) of length t+\ over GF{q), where two vectors are 
equivalent if one is a multiple of the other by an element of the field. Let G denote the graph 
whose vertices are the points of PG{q, t) and two (not necessarily distinct) vertices x and y are 
adjacent if and only if xqUo xtyt = 0. This construction is well known. In particular, in 

case t = 2 this graph is often called the Erdos-Renyi graph and it contains no cycles of length 
4. It is easy to see that the number of vertices of G is Uq^t = “ l)/ (o' “ l) = (l + o(l)) q^ 

and that it is dg^t-regular for dq^t = “ 1)/(9 “ l) = (l + where o(l) tends to 

zero as q tends to infinity. It is easy to see that the number of vertices of G with loops is 
bounded by 2 (g* — l)/(g — l) = (2 + o{l))q^~^, since for every possible value of xq, ..., xt-i 
we have at most two possible choices of xt- Actually using more complicated computation, 
which we omit, one can determine the exact number of vertices with loops. The eigenvalues 
of G are easy to compute (see El)- Indeed, let A be the adjacency matrix of G. Then, by 
the properties of PG{q, t), A'^ = AA^ = /i J -|- {dq^t — /u)/, where y. = {q^~^ — l)/(g — l), J is 
the all one matrix and I is the identity matrix, both of size Uq^t x Therefore the largest 
eigenvalue of A is dq^t and the absolute value of all other eigenvalues is dq^t — d- = 

7. The generalized polygons are incidence structures consisting of points V and lines C. For 
our purposes we restrict our attention to those in which every point is incident to (7 -|- 1 lines 
and every line is incident to g -|- 1 points. A generalized m-gon defines a bipartite graph G 
with bipartition ("P, C) that satisfies the following conditions. The diameter of G is m and 
for every vertex v € G there is a vertex u € G such that the shortest path from u to v has 
length m. Also for every r < m and for every two vertices u, v at distance r there exists a 
unique path of length r connecting them. This immediately implies that every cycle in G has 
length at least 2m. For g > 2, it was proved by Feit and Higman IMI that {q + I)-regular 
generalized m-gons exist only for m = 3,4, 6 . A polarity of G is a bijection 7 r:Pu£—>PUG 
such that 7 r(P) = C, 7 r(£) = P and is the identity map. Also for every p G P, f G £, 7 r(p) 
is adjacent to tt{1) if and only if p and I are adjacent. Given vr we define a polarity graph G’^ 
to be the graph whose vertices are point in P and two (not necessarily distinct) points pi,P 2 
are adjacent iff pi was adjacent to 7 r(p 2 ) in G. Some properties of G^ can be easily deduced 
from the corresponding properties of G. In particular, G^ is {q + l)-regular and also contains 
no even cycles of length less than 2 m. 

For every q which is an odd power of 2, the incidence graph of the generalized 4-gon has a 
polarity. The corresponding polarity graph is a ((7 -|- l)-regular graph with q^ + q'^ + q + 1 
vertices. See m, m for more details. This graph contains no cycle of length 6 and it is not 
difficult to compute its eigenvalues (they can be derived, for example, from the eigenvalues 
of the corresponding bipartite incidence graph, given in ESI). Indeed, all the eigenvalues, 
besides the trivial one (which is g -|- 1) are either 0 or ^/^ or — \/2q. Similarly, for every q 
which is an odd power of 3, the incidence graph of the generalized 6 -gon has a polarity. The 
corresponding polarity graph is a {q + l)-regular graph with q^ + q'^ + ■ ■ ■ + q + 1 vertices ( see 
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again m, m)- This graph contains no cycle of length 10 and its eigenvalues can be derived 
using the same technique as in case of the 4-gon. All these eigenvalues, besides the trivial 
one are either \/3g or — \/3q or ^ or —y/q. 

Cayley graphs 

8. Let G be a finite group and let S' be a set of non-identity elements of G such that S = S“^, 
i.e., for every s £ S, s~^ also belongs to S. The Cayley graph T{G,S) of this group with 
respect to the generating set S is the graph whose set of vertices is G and where two vertices g 
and g' are adjacent if and only if g'g~^ G S. Clearly, r(G, S) is |S|-regular and it is connected 
iff S is a set of generators of the group. If G is abelian then the eigenvalues of the Cayley 
graph can be computed in terms of the characters of G. Indeed, let y : G —> G be a character 
of G and let A be the adjacency matrix of r(G, S) whose rows and columns are indexed by 
the elements of G. Consider the vector v defined by ^{g) = xid)- Then it is easy to check 
that Av = av with a = X]sesA('S)- addition all eigenvalues can be obtained in this way, 
since every abelian group has exactly |G| different character which are orthogonal to each 
other. Using this fact, one can often give estimates on the eigenvalues of r(G, S) for abelian 
groups. 

One example of a Cayley graph that has already been described earlier is Pq. In that case 
the group is the additive group of the finite field GF{q) and S is the set of all quadratic 
residues modulo q. Next we present a slightly more general construction. Let q = 2kr -|- 1 be 
a prime power and let T be a Cayley graph whose group is the additive group of GF{q) and 
whose generating set is 5* = {x = | for some y G GF(g)}. By definition, T is {q — l)/k- 

regular. On the other hand, this graph is not strongly regular unless k = 2, when it is the 
Paley graph. Let y be a nontrivial additive character of GF{q) and consider the Gauss sum 
J2yeGF{q) x{y^)- Using the classical bound | Y.yeGF{q) x{y^)\ < {k - l)gU2 (gee e.g. (03]) and 
the above connection between characters and eigenvalues we can conclude that the second 
largest eigenvalue of our graph T is bounded by 

9. Next we present a surprising construction obtained by Alon j3| of a very dense pseudo-random 

graph that on the other hand is triangle-free. For a positive integer k, consider the finite field 
GF{2^), whose elements are represented by binary vectors of length k. If a,b,c are three 
such vectors, denote by (a, b, c) the binary vector of length 3k whose coordinates are those 
of a, followed by coordinates of b and then c. Suppose that k is not divisible by 3. Let 
Wo be the set of all nonzero elements a G GF{2^) so that the leftmost bit in the binary 
representation of is 0, and let Wi be the set of all nonzero elements a G GF{2^) for which 

the leftmost bit of is 1. Since 3 does not divide k, 7 does not divide 2 ^ — 1 and hence 
I Wo I = 2 ^~^ — 1 and |Wi| = 2^“^, as when a ranges over all nonzero elements of the field so 
does Let Gn be the graph whose vertices are all n = 2^^ binary vectors of length 3k, where 
two vectors v and v' are adjacent if and only if there exist wq G Wq and wi G Wi so that 
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V — v' = (tijq, 'ii’o’ ^o) + (^ 1 ) )) where here powers are computed in the field GF{2^) and 

the addition is addition modulo 2. Note that Gn is the Cayley graph of the additive group 
with respect to the generating set 5 = [/q + Ci, where Uq = {(tco, icq, tCg) | wq e Wq} 
and Ui is defined similarly. A well known fact from Coding Theory (see e.g., inni), which 
can be proved using the Vandermonde determinant, is that every set of six distinct vectors 
in UqL) Ui is linearly independent over GF{2). In particular all the vectors in Uq + Ui are 
distinct, S = |C/o||17i| and hence Gn is |5| = — l)-regular. The statement that 

Gn is triangle free is clearly equivalent to the fact that the sum modulo 2 of any set of 3 
nonzero elements of S is not a zero-vector. Let uq + ui,u'q + u'^ and Mq -|- u” be three distinct 
element of S, where uo,Uq,Uq G Uq and G Ui. By the above discussion, if the 

sum of these six vectors is zero, then every vector must appear an even number of times 
in the sequence {uo,Uq,Uq,ui,u'i,u'1). However, since Uq and Ui are disjoint, this is clearly 
impossible. Finally, as we already mentioned, the eigenvalues of Gn can be computed in 
terms of characters of Using this fact together with the Carlitz-Uchiyama bound on 

the characters of Z^^ it was proved in that the second eigenvalue of Gn is bounded by 
A < 9 • 2*^3 • 2^/2-h 1/4. 

10. The construction above can be extended in the obvious way as mentioned in jlOj . Let h >1 
and suppose that k is an integer such that 2^ — 1 is not divisible by Ah + 3. Let Wq be the set 
of all nonzero elements a G GF{2^) so that the leftmost bit in the binary representation of 
a^h+3 jg gg j-j^g gg^ q£ gjj nonzero elements a G GF{2^) for which the leftmost 

bit of is 1. Since Ah + 3 does not divide 2*^ — 1 we have that |lFo| = 2^“^ — 1 and 

|lUi| = 2^~^, as when a ranges over all nonzero elements of the field so does Define 

G to be the Cayley graph of the additive group with respect to the generating set 

S = Uq + Ui, where Uq = {{wo,Wq, ... \ wq G Wq] and Ui is defined similarly. 

Clearly, G is a 2^“^(2^“^ — l)-regular graph on 2 ( 2 ^+!)^ vertices. Using methods from |^, one 
can show that G contains no odd cycle of length < 2h + 1 and that the second eigenvalue of 
G is bounded by 0{2^). 


11. Now we describe the celebrated expander graphs constructed by Lubotzky, Phillips and Sar- 
nak m and independently by Margulis IMl. Let p and q be unequal primes, both congruent 
to 1 modulo 4 and such that p is a quadratic residue modulo q. As usual denote by PSL(2, q) 
the factor group of the group of two by two matrices over GF(q) with determinant 1 modulo 


its normal subgroup consisting of the two scalar matrices 


1 0 
0 1 


and 


-1 

0 


0 

-1 


. The 


graphs we describe are Cayley graphs of PSL{2,q). A well known theorem of Jacobi asserts 
that the number of ways to represent a positive integer re as a sum of 4 squares is 8 Yl 4 ji'd d\n 
This easily implies that there are precisely p -|- 1 vectors a = ( 00 , 01 , 02 , 03 ), where oq is an 
odd positive integer, oi, 02 , 03 are even integers and Qq + of + = p. Prom each such 
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ao + ioi 0,2 + icL^ 


and 


vector construct the matrix Ma in PSL{2,q) where Ma = ( 

V -0,2 + ios ao - ioi 

1 is an integer satisfying = —l(mod q). Note that, indeed, the determinant of Ma is 1 
and that the square root of p modulo q does exist. Let G^’’^ denote the Cayley graph of 
PSL{2,q) with respect to these p + 1 matrices. In [^21 it was proved that if g > 2y^ then 

is a connected {p + l)-regular graph on n = q{q‘^ — l)/2 vertices. Its girth is at least 

2 logp q and all the eigenvalues of its adjacency matrix, besides the trivial one Ai = p + 1, are 
at most 2y^ in absolute value. The bound on the eigenvalues was obtained by applying deep 
results of Eichler and Igusa concerning the Ramanujan conjecture. The graphs have very 
good expansion properties and have numerous applications in Combinatorics and Theoretical 
Computer Science. 


12. The projective norm graphs NGp^t have been constructed in HZl, modifying an earlier con¬ 
struction given in ISD' These graphs are not Cayley graphs, but as one will immediately 
see, their construction has a similar flavor. The construction is the following. Let t > 2 
be an integer, let p be a prime, let GF{p)* be the multiplicative group of the field with p 
elements and let GF{p^~^) be the field with p^~^ elements. The set of vertices of the graph 
NGp^t is the set V = GF{p*'~^) x GF{p)*. Two distinct vertices (X, a) and (T, 6) G V are 
adjacent if and only if N(X + Y)= ab, where the norm N is understood over GF{p), that 
is, N{X) = X^M^ Hp* Note that |R| = p* — p*“^. If (X, a) and iY,b) are adjacent, then 
{X,a) and Y ^ —X determine b. Thus NGp^t is a regular graph of degree p*“^ — 1. In 
addition, it was proved in HZl, that NGp^t contains no complete bipartite graphs 
These graphs can be also defined in the same manner starting with a prime power instead 
of the prime p. It is also not difficult to compute the eigenvalues of this graph. Indeed, 
put q = p^~^ and let A be the adjacency matrix of NGp^f The rows and columns of this 
matrix are indexed by the ordered pairs of the set GF{q) x GF{p)*. Let ip he a character of 
the additive group of GF{q), and let y be a character of the multiplicative group of GF{p). 
Consider the vector v : GF{q) x GF{p)* i—> G defined by v(X, a) = 'tp{X)x{a). Now one 
can check (see d, m for more details) that the vector v is an eigenvector of with 
eigenvalue | '^zeGF{q) V’(-^)x(-^(-2^))|^ and that all eigenvalues of have this form. Set 
x'iZ) = x(N(Z)) for all nonzero Z in GF{q). Note that as the norm is multiplicative, x' is 
a multiplicative character of the large field. Hence the above expression is a square of the 
absolute value of the Gauss sum and it is well known (see e.g. EH, EOl) that the value of 
each such square, besides the trivial one (that is, when either ip or x' are trivial), is q. This 
implies that the second largest eigenvalue of NGpg is 
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4 Properties of pseudo-random graphs 


We now examine closely properties of pseudo-random graphs, with a special emphasis on (n, d, X)- 
graphs. The majority of them are obtained using the estimate © of Theorem 12.Ill showing again 
the extreme importance and applicability of the latter result. It is instructive to compare the 
properties of pseudo-random graphs, considered below, with the analogous properties of random 
graphs, usually shown to hold by completely different methods. The set of properties we chose to 
treat here is not meant to be comprehensive or systematic, but quite a few rather diverse graph 
parameters will be covered. 


4.1 Connectivity and perfect matchings 

The vertex-connectivity of a graph G is the minimum number of vertices that we need to delete to 
make G disconnected. We denote this parameter by k{G). For random graphs it is well known (see, 
e.g., 120]) that the vertex-connectivity is almost surely the same as the minimum degree. Recently 
it was also proved (see inn and M) that random d-regular graphs are d-vertex-connected. For 
(n, d, A)-graphs it is easy to show the following. 

Theorem 4.1 Let G be an (n, d, X)-graph with d < nj^. Then the vertex-connectivity ofG satisfies: 

k{G) >d- 36AVrf- 


Proof. We can assume that A < d/6, since otherwise there is nothing to prove. Suppose that 
there is a subset S' C R of size less than d — 36A^/d such that the induced graph G[R — S] is 
disconnected. Denote by U the set of vertices of the smallest connected component of G\y — 5] 
and set W = R — (S U U). Then \W\ > (n — d)/2 > n/4 and there is no edge between U 
and W. Also |[/| -|- IS"! > d, since all the neighbors of a vertex from U are contained in S' U C/. 
Therefore \U\ > 36X'^/d. Since there are no edges between U and W, by Theorem mm we have 
that d|C/||IF|/n < Ay^|[/||IT|. This implies that 


A^n^ 

d2|W| 


\U\< 


X n Xn 1 An An 

dW\~d - 6 T ^ T' 


Next note that, by Theorem Em the number of edges spanned by U is at most 


e{U)< 


d\u? m\ 

2n 2 


Xnd\U\ X\U\ 


m I m 
2^2 


X\U\. 


As the degree of every vertex in U is d, it follows that 


e{U, S) >d\U\- 2e{U) > (d - 2X)\U\ > 2d\U\/3. 
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On the other hand using again Theorem 12.111 together with the facts that \U\ > 36A^/d, l^l < d 
and d < n/2 we conclude that 

^ d\U\ XVd\U\ d\U\ d\U\ _ 2d\U\ 

~ ^ QX/Vd 3 

This contradiction completes the proof. □ 

The constants in this theorem can be easily improved and we make no attempt to optimize them. 
Note that, in particular, for an (n, d, A)-graph G with A = 0{Vd) we have that k{G) = d — 0(1). 

Next we present an example which shows that the assertion of Theorem 14.11 is tight up to a 
constant factor. Let G be any (n, d, A)-graph with A = 0(\/d). We already constructed several 
such graphs in the previous section. For an integer k, consider a new graph Gk, which is obtained 
by replacing each vertex of G by the complete graph of order k and by connecting two vertices of 
Gfc by an edge if and only if the corresponding vertices of G are connected by an edge. Then it 
follows immediately from the definition that Gk has n' = nk vertices and is dh^egular graph with 
d' = dk + k — 1. Let X' be the second eigenvalue of Gk- To estimate X' note that the adjacency 
matrix of Gk equals to Aq < 8 ) Tfc + In ® Ak^.- Here Aq is the adjacency matrix of G, Jk is the all 
one matrix of size k x k, In is the identity matrix of size n x n and Ak,. is the adjacency matrix of 
the complete graph of order k. Also the tensor product of the mxn dimensional matrix A = {aij) 
and the s x t-dimensional matrix B = {bki) is the ms x nt-dimensional matrix A® B, whose entry 
labeled ((i, k){j, 1)) is aijbki- In case A and B are symmetric matrices with spectrums {Ai,..., A^}, 
{Hi,..., iJ-t} respectively, it is a simple consequence of the definition that the spectrum of A® B is 
{Xifik ■ i = I, ■ ■ ■ ,n, k = 1 ,... ,t} (see, e.g. [Ol])- Therefore the second eigenvalue of Aq ® Jk is kX. 
On the other hand In®AKf, is the adjacency matrix of the disjoint union of fc-cliques and therefore 
the absolute value of all its eigenvalues is at most k — 1. Using these two facts we conclude that 
X' < Xk + k — 1 and that Gk is {n' = nk, d' = dk + k — 1, X' = Xk + k — l)-graph. Also it is easy 
to see that the set of vertices of Gk that corresponds to a vertex in G has exactly dk neighbors 
outside this set. By deleting these neighbors we can disconnect the graph Gk and thus 

K{Gk) <dk = d' -ik-l) = d' - n{{X'f/d'). 

Sometimes we can improve the result of Theorem 14.11 using the information about co-degrees 
of vertices in our graph. Such result was used in ina to determine the vertex-connectivity of dense 
random d-regular graphs. 

Proposition 4.2 m Let G = {V,E) be a d-regular graph on n vertices such that ^/nlogn < d< 
3n/4: and the number of common neighbors for every two distinct vertices in G is {\ -\- o{l))d?/n. 
Then the graph G is d-vertex-connected. 
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Similarly to vertex-connectivity, define the edge-connectivity of a graph G to be the minimum 
number of edges that we need to delete to make G disconnected. We denote this parameter by 
k!{G). Clearly the edge-connectivity is always at most the minimum degree of a graph. We also 
say that G has a perfect matching if there is a set of disjoint edges that covers all the vertices of G. 
Next we show that (n, d, A)-graphs even with a very weak spectral gap are d-edge-connected and 
have a perfect matching (if the number of vertices is even). 

Theorem 4.3 Let G be an (n,d, X)-graph with d — X > 2. Then G is d-edge-connected. When n 
is even, it has a perfect matching. 


Proof. Let C/ be a subset of vertices of G of size at most n/2. To prove that G is d-edge- 
connected we need to show that there are always at least d edges between U and V{G) — U. If 
1 < |t^| < d, then every vertex in U has at least d — (|17| — 1) neighbors outside U and therefore 
e{U,V{G) — U) > \U\{d — \U\ -|- l) > d. On the other hand if d < \U\ < n/2, then using that 
d — A > 2 together with Theorem 12.111 we obtain that 


e{U,V{G)-U) 


> 


d\U\{n-\U\) 

n 


Xj\U\in-\U\){l 


(i 

n J \ n J 


n 2 


and therefore k'{G) = d. 

To show that G contains a perfect matching we apply the celebrated Tutte’s condition. Since 
n is even, we need to prove that for every nonempty set of vertices S, the induced graph G\y — S] 
has at most |5| connected components of odd size. Since G is d-edge-connected we have that there 
are at least d edges from every connected component of G[V — 5] to S. On the other hand there 
are at most d|5| edges incident with vertices in S. Therefore G/V — S] has at most liSI connected 
components and hence G contains a perfect matching. □ 


4.2 Maximum cut 

Let G = {V, E) be a graph and let S' be a nonempty proper subset of V. Denote by (S, V — S) the 
cut of G consisting of all edges with one end in S and another one in 1/ — S. The size of the cut 
is the number of edges in it. The MAX CUT problem is the problem of finding a cut of maximum 
size in G. Let f(G} be the size of the maximum cut in G. MAX CUT is one of the most natural 
combinatorial optimization problems. It is well known that this problem is NP-hard ES- Therefore 
it is useful to have bounds on f(G} based on other parameters of the graph, that can be computed 
efficiently. 

Here we describe two such folklore results. First, consider a random partition V = Ui U V 2 , 
obtained by assigning each vertex u G U to Vi or V 2 with probability 1/2 independently. It is easy 
to see that each edge of G has probability 1/2 to cross between Vi and V 2 - Therefore the expected 
number of edges in the cut (Vi, V 2 ) is m/2, where m is the number of edges in G. This implies that 
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for every graph f{G) > m/2. The example of a complete graph shows that this lower bound is 
asymptotically optimal. The second result provides an upper bound for f{G), for a regular graph 
G, in terms of the smallest eigenvalue of its adjacency matrix. 

Proposition 4.4 Let G be a d-regular graph (which may have loops) of order n with m = dn/2 
edges and let \i > X 2 >■■■> Xn be the eigenvalues of the adjacency matrix of G. Then 

na) 

In particular if G is an {n,d, X)-graph then f{G) < (d + A)n/4. 

Proof. Let A = (ajj) be the adjacency matrix of G = {V,E) and let V = re}. Let 

X = (xi,..., Xn) be any vector with coordinates ±1. Since the graph G is d-regular we have 

n 

{xi — Xj)'^ = d xj — QijXiXj = dn — y/'Ax.. 

(i,j)£E i=l i,j 

By the variational definition of the eigenvalues of A, for any vector 2 ; £ z^Az > A^H^lp. 
Therefore 

{xi — Xj)"^ = dn — x^Ax < dn — A,i||x|p = dn — Xnn. (11) 

{hj)eE 

Let V = Li U V 2 be an arbitrary partition of V into two disjoint subsets and let e(Li, V 2 ) be the 
number of edges in the bipartite subgraph of G with bipartition (Pi, V 2 ). For every vertex v £ V (G) 
define Xy = 1 \i v £ V\ and Xy = —1 ii v £ ¥ 2 - Note that for every edge (i, j) of G, (x, — Xj)^ = 4 if 
this edge has its ends in the distinct parts of the above partition and is zero otherwise. Now using 
(HU), we conclude that 

e(Pi,P 2 ) = ^ ^ (xi-Xj)^ < i(dre-Anre) = y □ 

This upper bound is often used to show that some particular results about maximum cuts are 
tight. For example this approach was used in [S] and |H1- In these papers the authors proved that 
for every graph G with rre edges and girth at least r > 4, /(G) > m/2-|-n(m'-+i). They also show, 
using Proposition 14.41 and Examples 9, 6 from Section 3, that this bound is tight for r = 4, 5. 


4.3 Independent sets and the chromatic number 

The independence number a{G) of a graph G is the maximum cardinality of a set of vertices of G 
no two of which are adjacent. Using Theorem 12. Ill we can immediately establish an upper bound 
on the size of a maximum independent set of pseudo-random graphs. 


Proposition 4.5 Let G be an {n,d, X)-graph, then 


a{G) < 


Xn 

d + X' 
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Proof. Let U be an independent set in G, then e{U) = 0 and by Theorem 12.111 we have that 
(i|f7p/n < A|f7|(l — |C/|/n). This implies that |f7| < An/((i + A). □ 

Note that even when A = 0(\/d) this bound only has order of magnitude 0{n/\^). This contrasts 
sharply with the behavior of random graphs where it is known (see m and m) that the indepen¬ 
dence number of random graph G{n,p) is only 0(^ logd) where d = (1 + o{l))np. More strikingly 
there are graphs for which the bound in Proposition 14.51 cannot be improved. One such graph is 
the Paley graph Pq with q = p^ (Example 3 in the previous section). Indeed it is easy to see that 
in this case all elements of the subheld GF{p) C GF{p^) are quadratic residues in GF{p^). This 
implies that for every quadratic non-residue (3 G GF{p^) all elements of any multiplicative coset 
f3GF[p) form an independent set of size p. As we already mentioned, Pq is an (n, d, A)-graph with 
n = d = (jP' — l)/2 and A = (p -|- l)/2. Hence for this graph we get a{Pq) = An/(d -|- A). 

Next we obtain a lower bound on the independence number of pseudo-random graphs. We 
present a slightly more general result by Alon et al. m which we will need later. 


Proposition 4.6 Let G be an {n,d, X)-graph such that A < d < 0.9n. Then the induced 
subgraph G[U] of G on any subset U,\U\ = m, contains an independent set of size at least 


a{G[U]) > 


n ^ f m{d — A) ^ 

2(d - A) Vra(A + l) ^ ) ■ 


In particular, 


a{G) > 


n 

2(d - A) 


( jd-X) 

V(A + 1) 



Sketch of proof. First using Theorem 12.Ill it is easy to show that if ?7 is a set of bn vertices of 
G, then the minimum degree in the induced subgraph G[U] is at most db + A(1 — 6) = (d — X)b + 
A. Construct an independent set I in the induced subgraph G[U] of G by the following greedy 
procedure. Repeatedly choose a vertex of minimum degree in G[U] , add it to the independent set 
I and delete it and its neighbors from U, stopping when the remaining set of vertices is empty. Let 
ai,i > 0 he the sequence of numbers dehned by the following recurrence formula; 


ao = m, ai+i = a, - (d— A(1 - —) -M) = ( 1 - 

n n 


d-X 


n 


a-i — {X + 1), Vi > 0. 


By the above discussion, it is easy to see that the size of the remaining set of vertices after i 
iterations is at least o*. Therefore the size of the resulting independent set I is at least the smallest 
index i such that Oj < 0. By solving the recurrence equation we obtain that this index satishes: 


i > 


n 

2(d - A) 


/ m(d — A) 
V n(A 1) 



□ 


For an (n, d, A)-graph G with A < d^“^, <5 > 0, this proposition implies that a{G) > log d). This 
shows that the independence number of a pseudo-random graph with a sufficiently small second 
eigenvalue is up to a constant factor at least as large as a{G{n,p)) with p = d/n. On the other hand 
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the graph (Example 4, Section 3) shows that even when A < 0{-s/d) the independence number of 
(n, d, A)-graph can be smaller than a{G{n,p)) with p = d/n. This graph has n = — 1 vertices, 

degree d = {1 + o{l))n/2 and A = 0(\/d). Also it is easy to see that every independent set in Hk 
corresponds to a family of orthogonal vectors in Z 2 and thus has size at most k = {1 + o(l)) log 2 n. 
This is only half of the size of a maximum independent set in the corresponding random graph 
G(n, 1/2). 

A vertex-coloring of a graph G is an assignment of a color to each of its vertices. The coloring 
is proper if no two adjacent vertices get the same color. The chromatic number xiG) of G is the 
minimum number of colors used in a proper coloring of it. Since every color class in the proper 
coloring of G forms an independent set we can immediately obtain that xiG) > \V{G)\/a{G). This 
together with Proposition 14.51 implies the following result of Hoffman j48j . 

Corollary 4.7 Let G he an {n,d, X)-graph. Then the chromatic number of G is at least 1 + d/X. 

On the other hand, using Proposition 14.61 one can obtain the following upper bound on the 
chromatic number of pseudo-random graphs. 


Theorem 4.8 m Let G be an (n, d, X)-graph such that X < d < 0.9re. Then the chromatic number 
of G satisfies 


X{G)< 


6{d — A) 
ln(^ + l) ■ 


Sketch of proof. Color the graph G as follows. As long as the remaining set of vertices U contains 
at least n/ln(^^ + 1) vertices, by Proposition 14.61 we can find an independent set of vertices in 
the induced subgraph G[U] of size at least 


n ( \U\{d-X) A 

2{d-X) \n{X + l) ) 


> 


n 


4(d - A) 


In 


fd-X 

yx + i 



Color all the members of such a set by a new color, delete them from the graph and continue. When 
this process terminates, the remaining set of vertices Lf is of size at most n/ln(^^ + 1) and we 
used at most 4:{d — A)/ln(^^ + 1) colors so far. As we already mentioned above, for every subset 
U' CU the induced subgraph G[U'] contains a vertex of degree at most 


\U' 


\U\ 


(d - A)J—^ + A < (d - A)^ + A < 


d- A 


n 


n 


ln( 


A+i ^ 


+ X< 


2(d-A) 
— + If 

A+l ^ 


ln( 


- 1 . 


Thus we can complete the coloring of G by coloring G[U] using at most 2(d — A)/ln(^^ + 1) 
additional colors. The total number of colors used is at most 6(d — A)/ln(^^ + 1). □ 

For an (n, d, A)-graph G with A < d^~^,5 > 0 this proposition implies that x(G) < 

This shows that the chromatic number of a pseudo-random graph with a sufficiently small second 
eigenvalue is up to a constant factor at least as small as xiG{n,p)) withp = d/n. On the other hand, 
the Paley graph Pq,q = p^, shows that sometimes the chromatic number of a pseudo-random graph 


28 









can be much smaller than the above bound, even in the case A = Q{^/d). Indeed, as we already 
mentioned above, all elements of the subfield GF{p) C GFijP') are quadratic residues in GF{p^). 
This implies that for every quadratic non-residue (5 G GF{p^) all elements of a multiplicative coset 
(5GF{p) form an independent set of size p. Also all additive cosets of (5GF{p) are independent sets 
in Pq. This implies that x{Pq) < \/9 = V- fact Pq contains a clique of size p (all elements of a 
subfield GF{p)), showing that x{Pq) = i/? ^/logg. Therefore the bound in Corollary 14.71 is best 
possible. 

A more complicated quantity related to the chromatic number is the list-chromatic number xi{G) 
of G, introduced in |M| and m This is the minimum integer k such that for every assignment of a 
set S{v) of k colors to every vertex v of G, there is a proper coloring of G that assigns to each vertex 
V a color from S{v). The study of this parameter received a considerable amount of attention in 
recent years, see, e.g., 12 ], inzj for two surveys. Note that from the definition it follows immediately 
that Xi{G) P x{G) s-ncl it is known that the gap between these two parameters can be arbitrarily 
large. The list-chromatic number of pseudo-random graphs was studied by Alon, Krivelevich and 
Sudakov ^2] and independently by Vu |H1] . In ^2] and |H1] the authors mainly considered graphs 
with all degrees (1 -|- o(l))np and all co-degrees (1 -|- o{l))np^. Here we use ideas from these two 
papers to obtain an upper bound on the list-chromatic number of an (re, d, A)-graphs. This bound 
has the same order of magnitude as the list chromatic number of the truly random graph G{n,p) 
with p = d/n (for more details see ^2], IH1|)- 

Theorem 4.9 Suppose that 0 < d < 1 and let G be an (re, d, \)-graph satisfying A < d^~^, d < 0.9re. 
Then the list-chromatic number of G is bounded by 

^ c (■ 

Proof. Suppose that d is sufficiently large and consider first the case when d < Then by 

Theorem 12 .11 1 the neighbors of every vertex in G span at most d^/n -\- Xd < 0{d?~^P) edges. Now 
we can apply the result of Vu IHlj which says that if the neighbors of every vertex in a graph G 
with maximum degree d span at most 0{d^~^P) edges then Xi{G) < O[d/{6 log d)). 

Now consider the case when d > . For every vertex u G V, let S{v) be a list of at least 

colors. Our objective is to prove that there is a proper coloring of G assigning to each vertex 
a color from its list. As long as there is a set G of at least vertices containing the same color 

c in their lists we can, by Proposition 14.hi find an independent set of at least log re vertices in C, 
color them all by c, omit them from the graph and omit the color c from all lists. The total number 
of colors that can be deleted in this process cannot exceed (since in each such deletion at 

least ^ log re vertices are deleted from the graph). When this process terminates, no color appears 
in more than lists, and each list still contains at least colors. Therefore, by 

Hall’s theorem, we can assign to each of the remaining vertices a color from its list so that no color 
is being assigned to more than one vertex, thus completing the coloring and the proof. □ 
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4.4 Small subgraphs 


We now examine small subgraphs of pseudo-random graphs. Let iL be a fixed graph of order s 
with r edges and with automorphism group Aut{H). Using the second moment method it is not 
difficult to show that for every constant p the random graph G{n,p) contains 


(1 + 0(1))/(1-P)(2)-" 


\Aut{H)\ 


induced copies of H. Thomason extended this result to jumbled graphs. He showed in izni that if 
a graph G is {p, a)-jumbled and p^n ^ 42as^ then the number of induced subgraphs of G which 
are isomorphic to H is {1 + o(l))p®(l — /\Aut{H)\. 

Here we present a result of Noga Alon |Sj that proves that every large subset of the set of 
vertices of (n, d, A)-graph contains the ’’correct” number of copies of any fixed sparse graph. An 
additional advantage of this result is that its assertion depends not on the number of vertices s in 
H but only on its maximum degree A which can be smaller than s. Special cases of this result have 
appeared in various papers including m, CHI and probably other papers as well. The approach 
here is similar to the one in m- 


Theorem 4.10 m Let H be a fixed graph with r edges, s vertices and maximum degree A, and let 
G = {V,E) be an {n,d, X)-graph, where, say, d < 0.9n. Let m <n satisfy m 3> A(^)^. Then, for 
every subset V of cardinality m, the number of (not necessarily induced) copies of H in V' is 

Note that this implies that a similar result holds for the number of induced copies of H. Indeed, if 
d and m S> then the number of copies of each graph obtained from H by adding to 

it at least one edge is, by the above Theorem, negligible compared to the number of copies of H, 
and hence almost all copies of H in V' are induced. If d = 0(n) then, by inclusion-exclusion, the 
number of induced copies of H in V' as above is also roughly the ’’correct” number. A special case 
of the above theorem implies that if A = 0{Vd) and d 3> then any (n, d, A)-graph contains 
many triangles. As shown in Example 9, Section 3, this is not true when d = + o(l))n^/^, 

showing that the assertion of the theorem is not far from being best possible. 

Proof of Theorem 14.101 To prove the theorem, consider a random one-to-one mapping of the 
set of vertices of H into the set of vertices V. Denote by A{H) the event that every edge of H is 
mapped on an edge of G. In such a case we say that the mapping is an embedding of H. Note that 
it suffices to prove that 

Pr(/l(ff)) = (l+o(l))(^y. (12) 

We prove m by induction on the number of edges r. The base case (r = 0) is trivial. Suppose 
that m holds for all graphs with less than r edges, and let uv be an edge of H. Let Huy be the 
graph obtained from H by removing the edge uv (and keeping all vertices). Let Hy and Hy be the 
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induced subgraphs of H on the sets of vertices V and V (H)\{u}, respectively, and let H' be 

the induced subgraph of H on the set of vertices V{H) \ {rt, v}. Let r' be the number of edges of H' 
and note that r — r' < 2(A —1) + 1 = 2A —1. Clearly Pr{A{Huv)) = Pr{A{Huv)\A{H'))-Pr{A{H')). 
Thus, by the induction hypothesis applied to and to H': 

Pr{A{H^,)\A{H')) = {l + o{l))(^^J ' ' . 


For an embedding f of H\ let y{u, f) be the number of extensions of f to an embedding of 
in V'] denotes the same for v. Clearly, the number of extensions of f to an embedding of 

Huv in V is at least iy{u, f')v{v^ f) — min(i/(u,/'), /')) and at most iy{u, f')v{v^ f). Thus we 

have 

v{u, f)iy{v, f) - minjuju, f), u{v, f)) ^ 

{m — s + 2){m — s + 1) “ ~ {m — s + 2){m — s + 1) 


Taking expectation over all embeddings f the middle term becomes Pr{A{Huv)\A{H')), which 
is (1 + . Note that by our choice of the parameters and the well known fact that 

A = ^l{Vd), the expectation of the term min(z^(u, /'), u{v, /')) ( < m) is negligible and we get 


Ef'{i^{u,f)i^{v,f)\ A{H')) = (l + o(l))m^ 



Now let / be a random one-to-one mapping of V{H) into V' . Let f be a fixed embedding of H'. 
Then 


Prf[A{H)\ f\viH)\{uM = f) = 


n 


{m — s + 2){m — s -|- 1) 


+ 


where |(5| < A • This follows from Theorem Em where we take the possible images 

of u as the set U and the possible images of v as the set W. Averaging over embeddings f we get 
Pr{A{H)\A{H')) on the left hand side. On the right hand side we get (1 -|- o(l))(^)''“^^ from the 
first term plus the expectation of the error term 6. By Jensen’s inequality, the absolute value of 
this expectation is bounded by 


^E{iy{uJ')u{v,f')) 
{m — s + 2){m — s -|- 1) 


\ /^\h-L-i)/2 

{i + o(i))AfA 


m \nj 


Our assumptions on the parameters imply that this is negligible with respect to the main term. 
Therefore Pr{A{H)) = Pr{A{H)\A{H')) ■ Pr{A{H')) = (1 J- o(l)) (^)^, completing the proof of 
Theorem 14.101 □ 

If we are only interested in the existence of one copy of H then one can sometimes improve the 
conditions on d and A in Theorem 14.101 For example if is a complete graph of order r then the 
following result was proved in El- 
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Proposition 4.11 m Let G he an {n,d, X)-graph. Then for every integer r > 2 every set of 
vertices of G of size more than 


(A + l)n 
d 


n 


1 + ^ + ^..+ 



contains a copy of a complete graph Kr- 


In particular, when d > and A < 0{y/d) then any (n, d, A)-graph contains a triangle and as 

shows Example 9 in Section 3 this is tight. Unfortunately we do not know if this bound is also tight 
for r > 4. It would be interesting to construct examples of (n, d, A)-graphs with d = 
and A < 0{Vd) which contain no copy of Kr- 

Finally we present one additional result about the existence of odd cycles in pseudo-random 
graphs. 


Proposition 4.12 Let k > 1 he an integer and let G be an {n, d, \)-graph such that d^^/n 3> A^^ ^ 
Then G contains a cycle of length 2k -\-l. 


Proof. Suppose that G contains no cycle of length 2k -\- 1. For every two vertices tt, u of G denote 
by d{u, v) the length of a shortest path from u to v. For every i > 1 let iVj(u) = {u \ d{u, v) = i} be 
the set of all vertices in G which are at distance exactly i from v. In m Erdos et al. proved that 
if G contains no cycle of length 2k -\-l then for any 1 < i < k the induced graph G[iVj(u)] contains 
an independent set of size \Ni{v)\/{2k — 1). This result together with Pronosition 14.51 imnlies that 
for every vertex v and for every 1 < i < /c, |A^i(u)| < {2k — l)Xn/d. Since d?^/n 3> X?^~^ we have 
that A = o{d). Therefore by Theorem 12.Ill 

e{Ni{v)) < < ‘ikX\Ni{v)\ = o{d\Ni{v)\). 

Next we prove by induction that for every 1 < i < fe, > (1 — o(l))(i^/A^. By the 

above discussion the number of edges spanned by Ni{v) is o{d‘^) and therefore e{Ni{v), N 2 {v)) = 
d? — o{df) = (1 — o(l))d^. On the other hand, by Theorem 12.1 1 1 

e{Ni{v),N2{v)) < -|A^i(u)||A^2(t^)| + A\/|jVi(u)||iV2(t’)| < - d + Xs/d\N2{v)\ 

Tl Tl Cl 

= Xd^j + 0{Xd) = + o{d^). 

Therefore > (1 — o(l))d^/A^. Now assume that > {1 —o{l))d^/X^. Since the number 

of edges spanned by Ni{v) is o(d|iVj(u)|) we obtain 

e{N,{v),N,+i{v)) = d\Ni{v)\-2e{Ni{v)) -e{Ni_i{v),Ni{v)) 

> d\Ni{v)\ - o{d\Ni{v)\) - d\Ni_i{v)\ 

> (1 - o(l))d|lV,(u)| - (1 + o{l))d{Xyd^)mv)\ 

= (1 - o(l))(i|Ni(u)| - o((i|Ni(u)|) = (1 - o(l))(i|iVj(u)|. 
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On the other hand, by Theorem 12.111 

e(iV,(^;),iV.+,(,;)) < -\Niiv)m+iiv)\ + X,/\Niiv)\\Ni+iiv)\ 


n 


< 


z liV.WI + AVIJViWIIJVi+iWI 


n 


= 0(A|iVi(^;)|) + A|iVi(r;)|- 




= xmiy)\^ 


\N^+liv)\ 

\Ni{v)\ 


+ o(d|iVi(?;)|). 


Therefore > (1 “ o(l))d^/A^ and we proved the induction step. 

Finally note that 


k-l 


n 2(1 + "(l))'! ($)'' = (! + °(1»» (“ - 1)T' 

This contradiction completes the proof. 

This result implies that when d ^ n^fc+r and A < 0{\/d) then any (n, d, A)-graph contains a cycle 
of length 2k + 1. As shown by Example 10 of the previous section this result is tight. It is worth 
mentioning here that it follows from the result of Bondy and Simonovits |22j that any d-regular 
graph with d ^ contains a cycle of length 2k. Here we do not need to make any assumption 
about the second eigenvalue A. This bound is known to be tight for /c = 2, 3, 5 (see Examples 6,7, 
Section 3). 


□ 


4.5 Extremal properties 

Turan’s theorem ISH is one of the fundamental results in Extremal Graph Theory. It states that 
among n-vertex graphs not containing a clique of size t the complete {t — l)-partite graph with 
(almost) equal parts has the maximum number of edges. Eor two graphs G and H we dehne the 
Turan number ex{G,H) of H in G, as the largest integer e, such that there is an H-free subgraph 
of G with e edges. Obviously ex{G,H) < |E(G)|, where E{G) denotes the edge set of G. Turan’s 
theorem, in an asymptotic form, can be restated as 

ex{Kn,Kt)= (^^3Y + o(l)) Q’ 

that is the largest Kt-free subgraph of Kn contains approximately |5f-fraction of its edges. Here 
we would like to describe an extension of this result to (n, d, A)-graphs. 

For an arbitrary graph G on n vertices it is easy to give a lower bound on ex{G,Kt) following 
Turan’s construction. One can partition the vertex set of G into t — 1 parts such that the degree 
of each vertex within its own part is at most j^-times its degree in G. Thus the subgraph 
consisting of the edges of G connecting two different parts has at least a |5f'fraction of the edges 
of G and is clearly Kt-fiee. We say that a graph (or rather a family of graphs) is t-Turdn if 
this trivial lower bound is essentially an upper bound as well. More precisely, G is t-Turan if 
ex(G,iFt) = (fEf + o(l))|E(G)|. 
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It has been shown that for any fixed t, there is a number m{t, n) such that almost all graphs 
on n vertices with m > m{t,n) edges are t-Turan (see [77], [^ for the most recent estimate for 
m{t,n)). However, these results are about random graphs and do not provide a deterministic 
sufficient condition for a graph to be t-Turan. It appears that such a condition can be obtained by 
a simple assumption about the spectrum of the graph. This was proved by Sudakov, Szabo and Vu 
in m They obtained the following result. 

Theorem 4.13 Let t > 3 be an integer and let G = {V,E) be an (n,d, X)-graph. If X = 
o{df~^/then 

ex{G,K^)={^-^ + o{l)^\E{G)\. 

Note that this theorem generalizes Turan’s theorem, as the second eigenvalue of the complete graph 
Kn is 1. 

Let us briefly discuss the sharpness of Theorem 14. 13L For t = 3, one can show that its condition 
involving n, d and A is asymptotically tight. Indeed, in this case the above theorem states that 
if d‘^/n » A, then one needs to delete about half of the edges of G to destroy all the triangles. 
On the other hand, by taking the example of Alon (Section O Example 9) whose parameters are: 
d = 0(n2/3), A = 0( and blowing it up (which means replacing each vertex by an independent 
set of size k and connecting two vertices in the new graph if and only if the corresponding vertices 
of G are connected by an edge) we get a graph G{k) with the following properties: 

\V{G{k))\ = Uk = nk; G{k) is dk = d/c-regular; G{k) is triangle-free; A(G(A:)) = kX and 

X{G{k)) = Q{dl/nk). 

The above bound for the second eigenvalue of G{k) can be obtained by using well known results on 
the eigenvalues of the tensor product of two matrices, see m for more details. This construction 
implies that for t = 3 and any sensible degree d the condition in Theorem 14. 131 is not far from being 
best possible. 

4.6 Factors and fractional factors 

Let H be a fixed graph on n vertices. We say that a graph G on n vertices has an H-factor if G 
contains n/h vertex disjoint copies of H. Of course, a trivial necessary condition for the existence 
of an H-factor in G is that h divides n. For example, if H is just an edge H = 1^2, then an H-factor 
is a perfect matching in G. 

One of the most important classes of graph embedding problems is to find sufficient conditions 
for the existence of an H-factor in a graph G, usually assuming that H is fixed while the order n 
of G grows. In many cases such conditions are formulated in terms of the minimum degree of G. 
For example, the classical result of Hajnal and SzemerMi m asserts that if the minimum degree 
(5(G) satisfies (5(G) > (1 — y)n, then G contains [n/rj vertex disjoint copies of Ky. The statement 
of this theorem is easily seen to be tight. 
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It turns our that pseudo-randomness allows in many cases to significantly weaken sufficient 
conditions for //-factors and to obtain results which fail to hold for general graphs of the same edge 
density. 

Consider first the case of a constant edge density p. In this case the celebrated Blow-up Lemma 
of Komlos, Sarkozy and Szemeredi IMl can be used to show the existence of //-factors. In order 
to formulate the Blow-up Lemma we need to introduce the notion of a super-regular pair. Given 
e > 0 and 0 < p < 1, a bipartite graph G with bipartition (Vi, V 2 ), |Vi| = IV 2 I = n, is called super 
{p, e)-regular if 

1. For all vertices v G V{G)^ 

{p — e)n < d{v) < {p + e)n ; 

2. For every pair of sets (U, W), U <zVi,W C V 2 , |//|, |VF| > en, 

e{U,W) |/;(G)| 

\U\\W\ n2 - ■ 

Theorem 4.14 EF For every choice of integers r and A and a real 0 < p < 1 there exist an e > 0 
and an integer no(e) such that the following is true. Consider an r-partite graph G with all partition 
sets Vi,... ,Vr of order n > uq and all (0 bipartite subgraphs Vj] super {p, e)-regular. Then 
for every r-partite graph H with maximum degree A(//) < A and all partition sets Xi,... ,Xr of 
order n, there exists an embedding f of H into G with each set Xi mapped onto Vi, i = 1,... ,r. 

(The above version of the Blow-up Lemma, due to Rodl and Rucihski [HI, is somewhat different 
from and yet equivalent to the original formulation of Komlos et al. We use it here as it is somewhat 
closer in spirit to the notion of pseudo-randomness). 

The Blow-up Lemma is a very powerful embedding tool. Combined with another ’’big cannon”, 
the Szemeredi Regularity Lemma, it can be used to obtain approximate versions of many of the 
most famous embedding conjectures. We suggest the reader to consult a survey of Komlos [HSI for 
more details and discussions. 

It is easy to show that if G is an (n, d, A)-graph with d = 0(n) and A = o(n), and h divides 
n, then a random partition of V{G) into h equal parts Ki,..., I 4 produces almost surely ( 2 ) super 
{d/n, e)-regular pairs. Thus the Blow-up Lemma can be applied to the obtained /i-partite subgraph 
of G and we get: 

Corollary 4.15 Let G be an {n,d, \)-graph with d = 0(n), A = o(n). If h divides n, then G 
contains an H-factor, for every fixed graph H on h vertices. 

The case of a vanishing edge density p = o(l) is as usual significantly more complicated. Here 
a sufficient condition for the existence of an //-factor should depend heavily on the graph H, as 
there may exist quite dense pseudo-random graphs without a single copy of H, see, for example, 
the Alon graph (Example 9 of Section 0) . When H = K 2 , already a very weak pseudo-randomness 
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condition suffices to guarantee an //-factor, or a perfect matching, as provided by Theorem I4.dl 
We thus consider the case H = K^, the task here is to guarantee a triangle factor, i.e. a collection 
of n/3 vertex disjoint triangles. This problem has been treated by Krivelevich, Sudakov and Szabo 
[59j who obtained the following result: 

Theorem 4.16 m Let G he an {n,d, X)-graph. If n is divisible by 3 and 

^ ° log n } ’ 

then G has a triangle factor. 

For best pseudo-random graphs with A = 0(\/d) the condition of the above theorem is fulfilled 
when d S> log^'^^ n. 

To prove Theorem 14.161 Krivelevich et al. first partition the vertex set V(G) into three parts 
^ 1 ,^ 2 , Vs of equal cardinality at random. Then they choose a perfect matching M between Vi an 
V 2 at random and form an auxiliary bipartite graph T whose parts are M and V 3 , and whose edges 
are formed by connecting e € M and u £ V 3 if both endpoints of e are connected by edges to v in 
G. The existence of a perfect matching in T is equivalent to the existence of a triangle factor in G. 
The authors of m then proceed to show that if M is chosen at random then the Hall condition is 
satisfied for T with positive probability. 

The result of Theorem 14.161 is probably not tight. In fact, the following conjecture is stated in 

m-- 

Conjecture 4.17 I53f There exists an absolute constant c> 0 so that every d-regular graph G on 
3n vertices, satisfying X{G) < cfff jn, has a triangle factor. 

If true the above conjecture would be best possible, up to a constant multiplicative factor. This is 
shown by taking the example of Alon (Section Example 9) and blowing each of its vertices by 
an independent set of size k. As we already discussed in the previous section (see also m), this 
gives a triangle-free d^-regular graph G{k) on vertices which satisfies X{G{k)) = Il(d\/nk). 

Krivelevich, Sudakov and Szabo considered in m also the fractional version of the triangle 
factor problem. Given a graph G = {V,E), denote by T = T{G) the set of all triangles of G. 
A function f : T ^ M_|_ is called a fractional triangle factor if for every v G V{G) one has 
/(O ~ V G contains a triangle factor Tq, then assigning values f{t) = 1 for all t G Tq, 
and f{t) = 0 for all other t G T produces a fractional triangle factor. This simple argument shows 
that the existence of a triangle factor in G implies the existence of a fractional triangle factor. The 
converse statement is easily seen to be invalid in general. 

The fact that a fractional triangle factor / can take non-integer values, as opposed to the 
characteristic vector of a ’’usual” (i.e. integer) triangle factor, enables to invoke the powerful 
machinery of Linear Programming to prove a much better result than Theorem I4.16L 

Theorem 4.18 Let G = {V,E) he a {n,d, X)-graph If X < O.ld'^/n then G has a fractional 
triangle factor. 
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This statement is optimal up to a constant factor - see the discussion following Conjecture 14.171 

Already for the next case H = K 4 ^ analogs of Theorem 14.161 and 1133 are not known. In fact, 
even an analog of Conjecture 14.171 is not available either, mainly due to the fact that we do not 
know the weakest possible spectral condition guaranteeing a single copy of 7^4, or Kj. in general, 
for r > 4. 

Finally it would be interesting to show that for every integer A there exist a real M and an 
integer no so that the following is true. If n > no and G is an (n, d, A)-graph for which A < d[d/n)^, 
then G contains a copy of any graph H on at most n vertices with maximum degree A (id) < A. 
This can be considered as a sparse analog of the Blow-up Lemma. 


4.7 Hamiltonicity 

A Hamilton cycle in a graph is a cycle passing through all the vertices of this graph. A graph is 
called Hamiltonian if it has at least one Hamilton cycle. For background information on Hamiltonian 
cycles the reader can consult a survey of Chvatal m- 

The notion of Hamilton cycles is one of the most central in modern Graph Theory, and many 
efforts have been devoted to obtain sufficient conditions for Hamiltonicity. The absolute majority 
of such known conditions (for example, the famous theorem of Dirac asserting that a graph on n 
vertices with minimal degree at least n/2 is Hamiltonian) deal with graphs which are fairly dense. 
Apparently there are very few sufficient conditions for the existence of a Hamilton cycle in sparse 
graphs. 

As it turns out spectral properties of graphs can supply rather powerful sufficient conditions 
for Hamiltonicity. Here is one such result, quite general and yet very simple to prove, given our 
knowledge of properties of pseudo-random graphs. 


Proposition 4.19 Let G be an {n,d, X)-graph. If 


then G is Hamiltonian. 


\2 

d-36— > 
d 


An 

d +A ’ 


Proof. According to Theorem 1231 G IS {d — 36A^/d)-vertex-connected. Also, a{G) < Xn/{d + A), 
as stated in Proposition 14.AL Finally, a theorem of Chvatal and Erdos I2ni asserts that if the vertex- 
connectivity of a graph G is at least as large as its independence number, then G is Hamiltonian. 
□ 

The Chvatal-Erdos Theorem has also been used by Thomason in izni, who proved that a {p, a)- 
jumbled graph G with minimal degree 6{G) = Ll{a/p) is Hamiltonian. His proof is quite similar in 
spirit to that of the above proposition. 

Assuming that A = o{d) and d —> oo, the condition of Proposition 14.191 reads then as: A < 
(1 — o{l))d?/n. Eor best possible pseudo-random graphs, where A = 0(\/d), this condition starts 
working when d = 
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One can however prove a much stronger asymptotical result, using more sophisticated tools for 
assuring Hamiltonicity. The authors prove such a result in EH: 

Theorem 4.20 Let G he an {n,d, X)-graph. If n is large enough and 

^ ^ (log log n)^ ^ 

“ 1000 log re(log log log n) ’ 

then G is Hamiltonian. 

The proof of Theorem 14.201 is quite involved technically. Its main instrument is the famous 
rotation-extension technique of Posa m, or rather a version of it developed by Komlos and Sze- 
meredi in EH to obtain the exact threshold for the appearance of a Hamilton cycle in the random 
graph G(n,p}. We omit the proof details here, referring the reader to EH]. 

For reasonably good pseudo-random graphs, in which A < 4^“*^ for some e > 0, Theorem oni 
starts working already when the degree d is only polylogarithmic in n - quite a progress compared 
to the easy Pronosition 14.1 hi It is possible though that an even stronger result is true as given by 
the following conjecture: 

Conjecture 4.21 There exists a positive eonstant G such that for large enough n, any (n, d, A)- 
graph that satisfies d/X> C contains a Hamilton cycle. 

This conjecture is closely related to another well known problem on Hamiltonicity. The tough¬ 
ness t{G) of a graph G is the largest real t so that for every positive integer x >2 one should delete 
at least tx vertices from G in order to get an induced subgraph of it with at least x connected 
components. G is t-tough if t{G) > t. This parameter was introduced by Chvatal in m, where 
he observed that Hamiltonian graphs are 1-tough and conjectured that t-tough graphs are Hamil¬ 
tonian for large enough t. Alon showed in [1] that if G is an (n, d, A)-graph, then the toughness of 
G satisfies t{G) > I}{d/X). Therefore the conjecture of Chvatal implies the above conjecture. 

Krivelevich and .Sudakov used Theorem 14.201 in EHj to derive Hamiltonicity of sparse random 
Cayley graphs. Given a group G of order n, choose a set S oi s non-identity elements uniformly at 
random and form a Cayley graph r(G', S U (see Example 8 in Section 3 for the definition of 
a Cayley graph). The question is how large should be the value of t = t{n) so as to guarantee the 
almost sure Hamiltonicity of the random Cayley graph no matter which group G we started with. 

Theorem 4.22 Let G be a group of order n. Then for every c > 0 and large enough n a 
Cayley graph X{G,S U 5“^), formed by choosing a set S o/clog^n random generators in G, is 
almost surely Hamiltonian. 

Sketch of proof. Let A be the second largest by absolute value eigenvalue of X{G,S). Note 
that the Cayley graph X{G,S) is d-regular for d > clog^n. Therefore to prove Hamiltonicity of 
X{G,S), by Theorem 14.201 it is enough to show that almost surely X/d < O(logn). This can be 
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done by applying an approach of Alon and Roichman uni for bounding the second eigenvalue of a 
random Cayley graph. □ 

We note that a well known conjecture claims that every connected Cayley graph is Hamiltonian. 
If true the conjecture would easily imply that as few as O(logn) random generators are enough to 
give almost sure connectivity and thus Hamiltonicity. 

4.8 Random subgraphs of pseudo-random graphs 

There is a clear tendency in recent years to study random graphs different from the classical by 
now model G{n,p) of binomial random graphs. One of the most natural models for random graphs, 
directly generalizing G{n,p), is defined as follows. Let G = {V,E) be a graph and let 0 < p < 1. 
The random subgraph Gp if formed by choosing every edge of G independently and with probability 
p. Thus, when G is the complete graph Kn we get back the probability space G{n,p). In many cases 
the obtained random graph Gp has many interesting and peculiar features, sometimes reminiscent 
of those of G{n,p), and sometimes inherited from those of the host graph G. 

In this subsection we report on various results obtained on random subgraphs of pseudo-random 
graphs. While studying this subject, we study in fact not a single probability space, but rather a 
family of probability spaces, having many common features, guaranteed by those of pseudo-random 
graphs. Although several results have already been achieved in this direction, overall it is much less 
developed than the study of binomial random graphs G{n,p), and one can certainly expect many 
new results on this topic to appear in the future. 

We start with Hamiltonicity of random subgraphs of pseudo-random graphs. As we learned 
in the previous section spectral condition are in many cases sufficient to guarantee Hamiltonicity. 
Suppose then that a host graph G is a Hamiltonian (n, d, A)-graph. How small can the edge 
probability p = p{n) be chosen so as to guarantee almost sure Hamiltonicity of the random subgraph 
Gp? This question has been studied by Frieze and the first author in m- They obtained the 
following result. 

Theorem 4.23 \4^ Let G be an {n,d, X)-graph. Assume that X = o ^ „ 3 / 2 pog^^) 3/2 ^ • Form a 
random subgraph Gp of G by ehoosing eaeh edge of G independently with probability p. Then for 
any funetion Lo(n) tending to infinity arbitrarily slowly: 

1. if p{n) = ^(logn -|- log log n — oj{n)), then Gp is almost surely not Hamiltonian; 

2. if p{n) = ^(logn -|- loglogn -|-a;(n)), then Gp is almost surely Hamiltonian. 

Just as in the case of G{n,p) (see, e.g. j^O]) it is quite easy to predict the critical probability 
for the appearance of a Hamilton cycle in Gp. An obvious obstacle for its existence is a vertex 

of degree at most one. If such a vertex almost surely exists in Gp, then Gp is almost surely non- 

Hamiltonian. It is a straightforward exercise to show that the smaller probability in the statement 
of Theorem gives the almost sure existence of such a vertex. The larger probability can be 
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shown to be sufficient to eliminate almost surely all vertices of degree at most one in Gp. Proving 
that this is sufficient for almost sure Hamiltonicity is much harder. Again as in the case of G{n,p) 
the rotation-extension technique of Posa m comes to our rescue. We omit technical details of the 
proof of Theorem I4.2d[ referring the reader to j32] • 

One of the most important events in the study of random graphs was the discovery of the sudden 
appearance of the giant component by Erdos and Renyi m- They proved that all connected 
components of G{n, c/n) with 0 < c < 1 are almost surely trees or unicyclic and have size O(logn). 
On the other hand, if c > 1, then G{n,c/n) contains almost surely a unique component of size 
linear in n (the so called giant component), while all other components are at most logarithmic in 
size. Thus, the random graph G{n,p) experiences the so called phase transition at p = Ijn. 

Very recently Frieze, Krivelevich and Martin showed ESI that a very similar behavior holds for 
random subgraphs of many pseudo-random graphs. To formulate their result, for a > 1 we define 
d < 1 to be the unique solution (other than a) of the equation xe~^ = ae“". 

Theorem 4.24 m Let G he an {n,d,X)-graph. Assume that A = o{d). Consider the random 
subgraph Ga/d, formed by choosing each edge of G independently and with probability p = a/d. 
Then: 

(a) //a < 1 then almost surely the maximum component size is O(logn). 

(b) If a > 1 then almost surely there is a unique giant component of asymptotic size (l — n 
and the remaining components are of size O(logn). 

Let us outline briefly the proof of Theorem 14.241 First, bound Q and known estimates on the 
number of fc-vertex trees in d-regular graphs are used to get estimates on the expectation of the 
number of connected components of size k in Gp, for various values of k. Using these estimates 
it is proved then that almost surely Gp has no connected components of size between (l/a 7 )logn 
and yn for a properly chosen 7 = 7 (a). Define f{a) to be 1 for all a < 1, and to be a/a for 
a > 1. One can show then that almost surely in G^/d the number of vertices in components of size 
between 1 and df/^ is equal to nf{a) up to the error term which is 0(n®/® logn). This is done by 
first calculating the expectation of the last quantity, which is asymptotically equal to nf{a), and 
then by applying the Azuma-Hoeffding martingale inequality. 

Given the above, the proof of Theorem 14.241 is straightforward. For the case a < 1 we have 
nf{a) = n and therefore all but at most n^/®logn vertices lie in components of size at most 
(l/a 7 )logn. The remaining vertices should be in components of size at least yn, but there is no 
room for such components. If a > 1, then {a/a)n + logn) vertices belong to components of 

size at most (l/a 7 )logn, and all remaining vertices are in components of size at least yn. These 
components are easily shown to merge quickly into one giant component of a linear size. The detail 
can be found in m (see also [ 7 ] for some related results). 

One of the recent most popular subjects in the study of random graphs is proving sharpness of 
thresholds for various combinatorial properties. This direction of research was spurred by a powerful 
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theorem of Priedgut-Bourgain m, providing a sufficient condition for the sharpness of a threshold. 
The authors together with Vu apply this theorem in |60j to show sharpness of graph connectivity, 
sometimes also called network reliability, in random subgraphs of a wide class of graphs. Here are 
the relevant definitions. For a connected graph G and edge probability p denote by f{p) = f{G,p) 
the probability that a random subgraph Gp is connected. The function f[p) can be easily shown to 
be strictly monotone. For a fixed positive constant x < 1 and a graph G, let px denote the (unique) 
value of p where f{G,px) = x. We say that a family of graphs satisfies the sharp threshold 

property if for any fixed positive e < 1/2 


lim 


p,{Gi) 

°° Pl-eiGi) 


1 . 


Thus the threshold for connectivity is sharp if the width of the transition interval is negligible 
compared to the critical probability. Krivelevich, Sudakov and Vu proved in m the following 
theorem. 


Theorem 4.25 f6YH / Let {Gi)^i be a family of distinct graphs, where Gi has n* vertices, maximum 
degree di and it is ki-edge-connected. If 

, ki In m 
lim = oo, 

i—>oo di 

then the family (Gi)^^ has a sharp connectivity threshold. 

The above theorem extends a celebrated result of Margulis m on network reliability (Margulis’ 
result applies to the case where the critical probability is a constant). 

Since (n, d, A) graphs are 4(1 — o(l))-connected as long as A = o(4) by Theorem 14.IL we imme¬ 
diately get the following result on the sharpness of the connectivity threshold for pseudo-random 
graphs. 

Corollary 4.26 Let G be an [n, d, X)-graph. If \ = o{d), then the threshold for connectivity in the 
random subgraph Gp is sharp. 

Thus already weak connectivity is sufficient to guarantee sharpness of the threshold. This result 
has potential practical applications as discussed in M- 

Finally we consider a different probability space created from a graph G = {V,E). This space 
is obtained by putting random weights on the edges of G independently. One can then ask about 
the behavior of optimal solutions for various combinatorial optimization problems. 

Beveridge, Frieze and McDiarmid treated in m the problem of estimating the weight of a 
random minimum length spanning tree in regular graphs. For each edge e of a connected graph 
G = {V,E) define the length of e to be a random variable uniformly distributed in the interval 
(0,1), where all Xf. are independent. Let mst{G,X) denote the minimum length of a spanning 
tree in such a graph, and let mst{G) be the expected value of mst{G, X). Of course, the value of 
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mst{G) depends on the connectivity structure of the graph G. Beveridge et al. were able to prove 
however that if the graph G is assumed to be almost regular and has a modest edge expansion, 
then mst{G) can be calculated asymptotically: 

Theorem 4.27 m Let a = a{d) = 0{d and let p{d) and oj{d) tend to infinity with d. 
Suppose that the graph G = {V, E) satisfies 

d < d{v) < (1 + a)d for all v e V{G ), 

and 

^ ^ ^ > ood"^^^ logd for all S C V with d/2 < IS"! < min{pd, |l^|/2} . 

Then 

mst{G) = (1 + o(l))^C(3) , 

where the o(l) term tends to 0 as d ^ oo, and C(3) = ~ 1.202.... 

The above theorem extends a celebrated result of Frieze EOl, who proved it in the case of the 
complete graph G = Kn- 

Pseudo-random graphs supply easily the degree of edge expansion required by Theorem 14.271 
We thus get: 

Corollary 4.28 Let G be an {n,d, X)-graph. If X = o{d) then 

71 

mst{G) = {l + o(l))-C(3) . 

Beveridge, Frieze and McDiarmid also proved that the random variable mst{G, X) is sharply 
concentrated around its mean given by Theorem 14.271 

Comparing between the very well developed research of binomial random graphs G{n,p) and few 
currently available results on random subgraphs of pseudo-random graphs, we can say that many 
interesting problems in the latter subject are yet to be addressed, such as the asymptotic behavior 
of the independence number and the chromatic number, connectivity, existence of matchings and 
factors, spectral properties, to mention just a few. 

4.9 Enumerative aspects 

Pseudo-random graphs on n vertices with edge density p are quite similar in many aspects to the 
random graph G{n,p). One can thus expect that counting statistics in pseudo-random graphs will 
be close to those in truly random graphs of the same density. As the random graph G{n,p) is 
a product probability space in which each edge behaves independently, computing the expected 
number of most subgraphs in G{n,p) is straightforward. Here are just a few examples: 

• The expected number of perfect matchings in G{n,p) is (assuming of course 

that n is even); 
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• The expected number of spanning trees in G{n,p) is 

• The expected number of Hamilton cycles in G{n,p) is 

In certain cases it is possible to prove that the actual number of subgraphs in a pseudo-random 
graph on n vertices with edge density p = p{n) is close to the corresponding expected value in the 
binomial random graph G{n,p). 

Frieze in m gave estimates on the number of perfect matchings and Hamilton cycles in what 
he calls super e-regular graphs. Let G = (H, E) be a graph on n vertices with (^)p edges, where 
0 < p < 1 is a constant. Then G is called super {p, e)-regular, for a constant e > 0, if 

1. For all vertices v G H(G), 

{p — e)n < d{v) < {p + e)n ; 


2. For all U,W cV,UnW 


0, \U\,\W\>en, 


e{U, W) 
~\U\\W[ 


< e . 


Thus, a super (p, e)-regular graph G can be considered a non-bipartite analog of the notion of 
a super-regular pair defined above. In our terminology, G is a weakly pseudo-random graph of 
constant density p, in which all degrees are asymptotically equal to pn. Assume that n = 2z^ is 
even. Let m{G) denote the number of perfect matchings in G and let h{G) denote the number of 
Hamilton cycles in G, and let t{G) denote the number of spanning trees in G. 


Theorem 4.29 EF If € is sufficiently small and n is sufficiently large then 

(a) 


(b) 




(p - 2e)"n! < h{G) < (p + 2e)"n! 


Theorem thus implies that the numbers of perfect matchings and of Hamilton cycles in 
super e-regular graphs are quite close asymptotically to the expected values of the corresponding 
quantities in the random graph G{n,p). Part (b) of Theorem 14. 291 improves significantly Corollary 
2.9 of Thomason 1121 which estimates from below the number of Hamilton cycles in jumbled graphs. 

Here is a very brief sketch of the proof of Theorem 14.291 To estimate the number of perfect 
matchings in G, Frieze takes a random partition of the vertices of G into two equal parts A and 
B and estimates the number of perfect matchings in the bipartite subgraph of G between A and 
B. This bipartite graph is almost surely super 2e-regular, which allows to apply bounds previously 
obtained by Alon, Rodl and Ruciiiski m for such graphs. 

Since each Hamilton cycle is a union of two perfect matchings, it follows immediately that 
h{G) < m'^{G)f2, establishing the desired upper bound on h{G). In order to prove a lower bound, 
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let /fc be the number of 2-factors in G containing exactly k cycles, so that fi = h{G). Let also A 
be the number of ordered pairs of edge disjoint perfect matchings in G. Then 

[n/3\ 

A=Y, • (13) 

i=l 

For a perfect matching M in G let qm be the number of perfect matchings of G disjoint from M. 
Since deleting M disturbs e-regularity of G only marginally, one can use part (a) of the theorem to 
get aM> {p- 2eY-;^. Thus 

A= (14) 

MeG ^ ^ 

Next Frieze shows that the ratio fk+i/fk can be bounded by a polynomial in n for all 1 < A; < A:i = 
fk < max{/fcg+i,for all k > ko + 2,ko = 0(p“^logn) and that the ratio 

ifk . • +/[n/ 3 j)//fci is also bounded by a polynomial in n. Then from (HTni . ^<Opii)Ek=ifk 

and thus A < Plugging (|T1|) we get the desired lower bound. 

One can also show (see PP) that the number of spanning trees t{G) in super (p, e)-regular graphs 
satishes: 

(p - 2e)^-^n^-^ < t{G) < [p + 2e)'^-^n^-‘^ , 

for small enough e > 0 and large enough n. In order to estimate from below the number of spanning 
trees in G, consider a random mapping / : V{G) —> V{G), defined by choosing for each u G P its 
neighbor f{v) at random. Each such / defines a digraph Df = {V,Af), Af = {{v,f{v)) : v G V}. 
Each component of Dj consists of cycle G with a rooted forest whose roots are all in G. Suppose that 
Df has kf components. Then a spanning tree of G can be obtained by deleting the lexicographically 
first edge of each cycle in Df, and then extending the kf components to a spanning tree. Showing 
that Df has typically 0{^/n) components implies that most of the mappings / create a digraph 
close to a spanning tree of G, and therefore; 

t{G) > : P ^ P| > - e)n” . 

For the upper bound on t{G) let fl* = {f : V ^ V : {v,f{v)) G E{G) for u / 1 and /(I) = 1}. 
Then 

t{G) < |Q*| < {{p + e)n)'"~^ < (p + 2e)'^“^n”“2 . 

To see this consider the following injection from the spanning trees of G into f}*: orient each edge 
of a tree T towards vertex 1 and set /(I) = 1. Note that this proof does not use the fact that the 
graph is pseudo-random. Surprisingly it shows that all nearly regular connected graphs with the 
same density have approximately the same number of spanning trees. 

For sparse pseudo-random graphs one can use Theorem l4.2dl to estimate the number of Hamilton 
cycles. Let G be an (re, d, A)-graph satisfying the conditions of Theorem I4.2dl Consider the random 
subgraph Gp of G, where p = (log re -|- 21oglogre)/d. Let X be the random variable counting the 


44 






number of Hamilton cycles in Gp. According to Theorem irai Gp has almost surely a Hamilton 
cycle, and therefore E[X] > 1—o(l). On the other hand, the probability that a given Hamilton cycle 
of G appears in Gp is exactly p"". Therefore the linearity of expectation implies E[X] = h{G)p'^. 
Combining the above two estimates we derive: 


h{G) > 


l-o(l) 

pn 


_ ^^ 

(1 + o(l)) logn/ 


We thus get the following corollary: 


Corollary 4.30 Let G be an {n,d, X)-graph with A = o((i^/^/(n^/^(log Then G con¬ 

tains at least ^ logn ) Hamilton cycles. 

Note that the number of Hamilton cycles in any d-regular graph on n vertices obviously does not 
exceed d^. Thus for graphs satisfying the conditions of Theorem 14.231 the above corollary provides 
an asymptotically tight estimate on the exponent of the number of Hamilton cycles. 


5 Conclusion 

Although we have made an effort to provide a systematic coverage of the current research in pseudo¬ 
random graphs, there are certainly quite a few subjects that were left outside this survey, due to 
the limitations of space and time (and of the authors’ energy). Probably the most notable omission 
is a discussion of diverse applications of pseudo-random graphs to questions from other fields, 
mostly Extremal Graph Theory, where pseudo-random graphs provide the best known bounds for 
an amazing array of problems. We hope to cover this direction in one of our future papers. Still, 
we would like to believe that this survey can be helpful in mastering various results and techniques 
pertaining to this field. Undoubtedly many more of them are bound to appear in the future and 
will make this fascinating subject even more deep, diverse and appealing. 

Acknowledgment. The authors would like to thank Noga Alon for many illuminating discussions 
and for kindly granting us his permission to present his Theorem l4.1()l here. The proofs of Theorems 
mioi were obtained in discussions with him. 
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